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This report analyzes data obtained as part of the Skylab 

program, specifically the earth resources component of that 

program. Data was obtained over the Michigna test area on 

June 12, 1973; August 5, 1973 and September 18, 1973. Sensor 

imagery from the S-190A, S-190B and S-192 was utilized in the 

study and compared with ground truth obtained by the PI. 

The statement of work for Contract NAS9-13332 provided: 

"The PI shall apply the identified information 
extraction techniques to the Skylab and aircraft 
data for crop discrimination, mensuration and 
analysis results for cost effectiveness and 
accuracy using ground truth and the agriculture 
statistical reporting service as a data base. 

The efforts are to include specifically: 

2. 1.3.1 The PI shall by photointerpretation 
analyze EREP imagery to construct basic land 
use and crop maps. 

2. 1.3. 2 The PI will investigate digital tech- 
niques to discriminate crops and characteristic 
signatures which indicate crop health and vigor. 

2. 1.3. 3 The PI shall differentiate crops within 
a resolution cell by appropriate digital analysis 
techniques. 

2. 1.3. 4 The PI shall compare his crop discrim- 
ination, mensuration and predicted yields with 
the agriculture statistical reporting service 
for accuracy. 

2. 1.3. 5 The PI shall make a cost effectiveness 
study to determine the feasibility of using remote 
sensing for crop inventories and mensurations. 

This report leads to the following general conclusions 

which are more fully developed in the text of the report. 

Analytical techniques and the rationale leading to the con- 


i 


elusions are also in the report text. 

Photointerpretation of S-190A and S-190B Imagery showed 
significantly better resolution with the S-190B system. A 
small tendency to underestimate acreage was observed. This 
averaged 6 percent and varied with field size. Fields of less 
than 10 acres were estimated as larger than actual. As long 
as fields are greater than 5 acres in size the resolution of 
the S-190B system is adequate. 

The S-19QB system had adequate resolution for acreage 
measurement but the color film did not provide adequate con- 
trast to allow detailed classification of ground cover from 
Imagery of a single date. In total 78 percent of the fields 
were correctly classified but with 56 percent correct for the 
major crop, corn. Part of this difficulty can be attributed 
to the existence of dual signatures for corn on this date. 

Acreage measurement is more critical to this study. Use 
of S-190A imagery in the June 12, 197.3 imagery resulted in 
91 to 95 percent accuracy in estimating acreage of hare soil, 
forest, grass, forage, grain, etc. categories after a ratio 
correction of 25 percent for underestimation was applied. 

Analysis of S-192 was conducted by a subcontractor, 
E.R.I.M. These data had been acquired on August 5, 1973 at 
15:02 GMT. The crop recognition, accuracy which was achieved 
during this investigation was shown to he related to the amount 
of data available for training the computer. Accuracy in- 
creased as 10, 20 and then 40 sections were made available for 


extracting training statistics via a supervised clustering 
approach. Even with 40 sections available for training, 
however, the average absolute accuracy of roughly 70 percent 
for 5 classes was somewhat disappointing. These relatively 
low values were attributed to: 

1) the data were gathered at a non-optimum time in 
early August when corn, and other crops were quite 
variable in their state of maturity, 

2) the atmospheric conditions over the test site were 
fairly hazy thereby reducing available contrast, 

3) the data gathered by the S-192 had significant defi- 
ciencies with regard to signal-to-noise ratio 

in some bands, the dynamic range covered by the 
signals, and channel-to-channel spatial registra- 
tion. 

Attempts to discriminate for health and vigor using S-.190A, 
S-190B and S-192 imagery all proved unsuccessful. This finding 
is consistent with the difficulty in obtaining accurate acreage 
estimation which is, of course, much easier. 

A mixtures classifier was applied in an attempt to in- 

; > f 

crease the accuracy of crop classification of pixels in the 
S-192 data. The error rate was slightly larger using the 
mixtures classifier than it was with the linear classifier. 
Surprisingly, only 18 percent of the pixels were classified 
as mixtures. Given the field and pixel sizes many more mix- 
tures had been expected. 
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The statistical Reporting Service (SRS) of the U.S.D.A. 
currently publishes estimates of crop acreage for the U.S. 
and major producing states. Their probability based surveys 
cover about 150 commodities including crops, fruits, nuts, 
livestock and poultry. Their accuracy in crop acreage esti- 
mation is a function of the total expenditures for the survey 
as well as the proportion of the acreage in that crop. At 
current cost levels their estimates of national aggregates 
are more accurate than those obtained from the Skylab data 
set. Therefore, the SRS estimates are more accurate than 
would be obtained from a national sample using Skylab _i_f the 
accuracy found in this study is representative of an opera- 
tional system. Obviously, improved technology, more frequent 
data collection, use of crop calendars, etc. would change 
this comparison. 

The report text contains information on costs of both 
the current SRS system and the classification costs of pro- 
cessing the Skylab data. While lack of data restrict the 
quality of this analysis, it is clear that the Skylab system 
was not as cost-effective as the SRS system as both operated 
in 1973. 
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This report contains the result of a study entitled 
"Economic Evaluation of Crop Acreage Estimation by Multi- 
spectral Remote Sensing" under a contract between the National 
Aeronautics and Space Administration (NASA) and Michigan 
State University (MSU) * In turn MSU subcontracted with the 
Environmental Research Institute oj\ Michigan (ERIM) for the 
computer analysis of S-192, mul tispectral scanner data* 

The statement of work on the contrafit, NA39-13332 pro- 


vided: 


"The PI shall apply the identified information ex- 
traction techniques to the Skylab and aircraft data 
for crop discrimination, mensuration and analysis 
results for cost effectiveness and accuracy using 
ground truth and the agriculture statistical report- 
ing service as a data base. The efforts are to in- 
clude specifically: 


2.1. 3.1 The PI shall by photointerpretation 
EREP imagery to construct basic land use and 


analyze 
crop maps* 


2 * 1 * 3 • 2 The PI will investigate digital techniques 
to discriminate crops and characteristic signatures 
which indicate crop health and vigor. 


2. 1*3*3 The PI shall differentiate crops within a 
resoultion cell by appropriate digital analysis tech- 
niques* 


2. 1*3*4 The PI shall compare his crop discrimination, 
mensuration and predicted yields with the agriculture 
statistical reporting service for accuracy* 


2*1. 3. 5 The PI shall make a cost effectiveness study 
to determine the feasibility of using remote sensing 
for crop inventories and mensurations." 


Skylab overflights 
12, 1973, August 5, 
August 5, 1973 pass 


obtained information over Michigan on June 
1973 and September 18, 1973* Only the 

occurred in good weather with all sensors 


obtaining usable information. 


Sensors utilized to provide data for this study included 
The S-190A and S-190B cameras as well as the S-192 multispec- 
tral scanner. Aircraft underflights provided additional data. 
Ground truth was compiled from the U.S.D.A. Agricultural Sta- 
bilization and Conservation Service offices, field visits, and 
aerial photography. 

The Statistical Reporting Service of the U.S.D.A. pro- 
vided information on the current system of acreage estimation. 

This report contains analyses of the Skylab data and a 
comparison of the interpreted Sky lab data with ground truth. 
These results are then utilized to compare current acreage 
estimation procedures with potential procedures using Skylab 
technology. 


Skylab Intensive Test Site 

The whole of Michigan T s Lower Peninsula served as a test 
site in the sense that all available Skylab imagery over Mich- 
igan was examined for quality, interpretability , and major 
features of interest; if not directly by personnel on the 
present project, then by other units across the MSU campus to 
whom the imagery was made available according to their special 
interests in the respective areas. However, the major thrust 
of effort with regard to ground truth and analysis of field 
crops in this project was concentrated in a test strip of ap- 
proximately 90 square miles located in eastern Ingham County 
Michigan (Figure 1) . This test strip is centrally located 
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between the MSU and ERIM laboratories, which helped to keep 
travel costs to a minimum for both institutions. The test 
strip runs 18 miles alot.g Dietz Road, extending three miles 
east of Dietz Road and two miles west (see portion of Ingham 
County map in Figure 2). Most of Locke (T4N, R2E) , Leroy 
(T3N, R2E) , and White Oak (T2N, R2E) Townships are included 
in the test strip. The test strip includes the variety of 
crop types and field sizes needed for purposes of the study. 
Figures 3a and 3b are reproductions of RB-57 airphotos cover- 
ing the northern and southern portions of the test strip, 
respectively. 

The test strip is about 99% rural, the town of Webberville 
being an exception. A major interstate highway (1-96) crosses 
the test area just south of Webberville. The area is character- 
ized by intensive agriculture including corn, beans, small 
grains, forage crops, lettuce and cooking onions. In addition, 
the strip contains a few swampy areas and farm woodlots or 
bushy areas which are also present on most sections. It is 
representative of much of mid-Michigan agricultural land. A 
breakdown of the ground truth data for the test area by field 
size is given in Table 1, and by major crop types in Table 2. 
Small farmsteads, fence lines, secondary roads, etc. are ex- 
cluded from the tabulations in Tables 1 and 2. Table 3 com- 
bines the previous Tables to provide data on crop acreages 
and number of fields by field size. 















Figure 3a. Reproduction of RR-57 airphoto covering northern portion of Skylab 
intensive test area. 
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Table 1. Breakdown, of ground truth information for 




Skylab 

intensive test 

area by field 

size • 

Field size 
(acres) 

No. of 
fields 

% of 
fields 

Total 

acres 

% of 
acres 

0 - 

5 

367 

13.67 

1194.8 

2.20 

5+- - 

10 

625 

23.28 

4673.6 

8.59 

10+ - 

20 

811 

30.21 

11950.6 

21.96 

20+ - 

30 

417 

15.53 

10199.9 

18.74 

30+ - 

40 

173 

6.44 

6033.2 

11.09 

40+ - 

50 

96 

3.58 

4239.4 

7.79 

50+ - 

60 

66 

2.46 

3599.9 

6.61 

60+ — 

70 

41 

1.53 

2647.6 

4.86 

70+ - 

80 

33 

1.23 

2434.2 

4.47 

80+ - 

90 

13 

.48 

1108,4 

2.04 

90+ - 

100 

3 

.11 

285.0 

.52 

100+ - 

120 

14 

.52 

1504.5 

2.76 

120+ - 

140 

8 

.30 

1035.6 

1.90 

140+ - 

160 

5 

• i9 

735.5 

1.35 

160+ - 

180 

6 

.22 

1000.9 

1.84 

180+ ~ 

200 

2 

.07 

386.2 

.71 

200+ - 

250 

1 

.04 

241.9 

.44 

2504- - 

300 

2 

.07 

528.3 

97 

300+ 


2 

.07 

630.3 

1.16 

Totals 


2685 

100.00 

54429.8 

100.00 
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Table 2. Breakdown, of ground truth information for Sky lab intensive 
test area by major crop type. 


Crop type 

No . fields 

% fields 

Total acres 

% acres 

Corn 

661 

24.62 

16,508.9 

30.33 

Beans 

137 

5.10 

2,633.2 

4.84 

Forage crops 

656 

24.43 

12,710.7 

23.35 

Truck crops 

6 

.22 

182.0 

.33 

Stubble/ grain 

226 

8.42 

3,765.9 

6.92 

Bare soil 

311 

11.58 

3,963.5 

7.28 

Weeds 

201 

7.49 

4,084.7 

7.51 

Woods/brush 

434 

16.17 

9,891.4 

18.17 

Wetland 

10 

.37 

105.0 

.19 

Other 

43 

1.60 

584.5 

1.08 

Total 

2,685 

100.00 

54,429.8 

100.00 
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Table 3, 

Distribution of major crop types by field 
F = number of fields, A *■ total acres. 

size in the Ingham County intensive test area- 

l 



Field size 
( acres) 

Corn 

Beans 

Forage 

crops 

Truck 

crops 

Stubble/ 

grains 

Bare 

soil 

Weeds 

Woods/ 

Brush 

Wet- 

land 

Other 


0 - 

5 

37 F 
130, 9A 

7F 
23. 7A 

?3F 
263. 5A 

IF 

2.3A 

19F 

67. 6 a 

82F 

267.4A 

28F 

95>5A 

96F 
264. OA 

4F 

12.8A 

20F 
67. 1A 


> - 

10 

U2F 

870,6 a 

30F 
230. 2A 

168F 

1240.6a 

OF 

O.OA 

65F 

468.7A 

86F 
6 47. 2A 

53F 

386.2A 

96F 
703. 7A 

4F 
29. 2A 

11F 
77. 2A 


10+ - 

20 

20 8F 

315*1. 3A 

55F 
816. 1A 

219F 
3166. OA 

OF 

O.OA 

84F 
1273. 8A 

93F 
1324. 3A 

53F 
788. 2A 

94F 

1356. 9A 

IF 

13.3A 

4F 
57. 7A 


20+ - 

3C 

1 39F 
34-50, 6 a 

■:<'F 
704. 5A 

99F 

2419.4A 

2F 
45.0 A 

27F 
655. 7A 

27F 
654. 2A 

29F 
681. 7A 

62F 

1501.4a 

OF 

O.OA 

4F 

0? ,6A 


30+ - 

40 

62F 

2145.6a 

9F 

328. 9A 

30F 
1032. 1A 

IF 

36,8 a 

17F 
592. ?A 

12F 
426. 9A 

15F 

499.6A 

27F 

970.6A 

OF 

O.OA 

OF 

O.OA 


4>+ - 

50 

35F 

1545.2A 

2F 
93. 3A 

27F 
1196. 1A 

IF 
43. 2A 

9F 

383. OA 

2F 
90. 8A 

5F 

216. OA 

12F 
536. 3 A 

1 j? 

49. ?A 

2F 

85.8 a 

1 

H 

O 

50+ - 

6o 

29F 

1576.8 a 

IF 
57. 0A 

11F 
595. 2A 

IF 
54. 7A 

3F 

154.9A 

6f 
331. 1A 

4F 

219. 7A 

10F 

558.6a 

OF 

O.OA 

IF 
5J .9A 


60+ - 

70 

13F 
856. OA 

2F 

129.0A 

10F 
642. 5A 

OF 

O.OA 

IF 
60. 8A 

IF 
65.O A 

5F 

318.6a 

9F 

575. 7A 

OF 

O.OA 

OF 

O.OA 


70+ - 

80 

12F 
899. 7A 

2F 

144. 5A 

5F 
370. 2A 

OF 

O.OA 

OF 

O.OA 

IF 
72. 5A 

4F 

300,6 a 

9F 

646. 7A 

OF 

O.OA 

OF 

O.OA 

C go- 

s n 

h-* i-n 

CT> i- ; 

FI i t+vi 

ip O 

80+ - 

90 

4F 

337.4a 

OF 

0.0A 

4F 
336. 8A 

OF 

O.OA 

IF 
88. 7A 

IF 
84. 1A 

2F 

172.5A 

IF 
88. 9A 

OF 

O.OA 

OF 

O.OA 

P a 

* '• 

90+ - 

100 

IF 
90. 5A 

OF 

0.0A 

OF 

O.OA 

OF 

O.OA 

OF 

O.OA 

OF 

O.OA 

OF 

O.OA 

2F 
194. 5A 

OF 

O.OA 

OF 

C OA 

a r 
fcd 

m ' i! 

100+ - 

160 

6f 

725. 9A 

IF 

106. OA 

?F 

829.1 A 

OF 

O.OA 

OF 

O.OA 

OF 

O.OA 

2F 

245. 2A 

10? 
1202. OA 

OF 

O.OA 

IF 

157.4A 

i_j 0 

T) i-ri 

O J 
O H 
W M 

160+ 


3P 

415,4a 

OF 

O.OA 

3F 
619. 2A 

OF 

O.OA 

OF 

O.OA 

OF 

O.OA 

IF 

160. 9A 

6F 

1282.1 A 

OF 

O.OA 

OF 

O.OA 

£3 
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Ground Truth Information 


Three basic sources of information were utilized in 
assembling Aground truth" data. The first was USDA, ASCS 
field certification records for Ingham County. The second 
was field visitation by technicians employed by MSU for the 
study. The third source of information was photointerpre- 
tation of underflight imagery by technicians employed for the 
project. All of this ground truth information was obtained 
during the 1973 growing season. 

The first set of ground truth information to become 
available was provided through the cooperation of the Agri- 
cultural Stabilization and Conservation Service (USDA, ASCS) . 
During the spring of 1973, this agency conducted an annual 
certification of acreages planted to crops and acreage set- 
aside from production under the Federal wheat and feed grain 
programs. This certification was recorded in the form of 
annotations on enlarged photocopies of black and white air- 
photos. The approximate scale of these photocopies was 1:7, 

920 or 8 Inches to the mile. At this scale, each section was 
covered conveniently by a single page in a loose-leaf notebook. 
It should be noted in passing that ASCS no longer conducts 
this type of certification program. These certification re- 
cords did not constitute a complete ground truth base because 
data was collected only for owners participating in the Federal 
programs. Since the black and white airphotos annotated for 
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certification were several yearB old, they could not be used 
for extracting current crop information in areas not covered 
by certification. Nevertheless, the old airphotos were use- 
ful for developing a field numbering system since field pat- 
tern is relatively stable over time. The field numbering 
system was structured as follows: 

Township identifier - 1 digit 
Section number - 2 digits 
Field number within section - 3 digits 
Subdivision of field — 1 decimal digit 
The field numbers were recorded directly on the ASCS photo- 
copies and kept in a loose-leaf notebook. 

During August of 1973 a program of field visitation by 
the project technicians was undertaken to fill gaps in the 
certification records. Most fields within the test area that 
were accessible from roads or without violation of trespass 
laws were visited. For each field the crop species or domi- 
nant natural vegetation was recorded and any unusual conditions 
noted. Extensive use of ground-level photography was made dur- 
ing these field visits. 

The combination of ASCS certification records and on- 
site observation of accessible fields still left gaps in the 
interior of sections. Acreage figures were available for 
fields covered in the ASCS records, but actual acreage measure- 
ment on the ground for other fields during field visitation 
would have been prohibitively time consuming. Thus, another 

j 
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source of information was needed for identifying crops in the 
interior of sections and for acreage measurement. Photointer- 
pretive analysis of underflight imagery flown with the NASA 
U-2 and ERIM C-47 aircraft was used to obtain this information. 
The U-2 Skylab support flight took place on August 11, 1973 
and provided complete stereo coverage of the test area at a 
nominal scale of 1*130,000 on CIS. film'. The C— 47 imagery was 
collected during a simultaneous underflight of the August 5, 

19/3 Skylab pass. The C-47 sensor complement included several 
cameras loaded with different film types and the ERIM multi- 
spectral scanner. Imagery was collected at several flying 
heights. Only a portion of the test area was included in the 
C-47 mission since its primary purpose was to provide informa- 
tion on atmospheric and ground conditions for use in computer 
analysis of S-192 scanner data. 

The C— 47 imagery became available for use in the vault 
at ERIM laboratories before the U-2 imagery was received. There- 
fore, 70mm CIR imagery from this mission was used for crop iden- 
tification to fill gaps in ground truth for the limited area 
covered. Crop identification for the remainder of the area 
was completed later upon receipt of the U-2 imagery. Acreages 
were scaled either from the ASCS black and white photocopies 
or frcm the U-2 imagery. Several methods of scaling acreages 
were used depending on the preference of the interpreter. Meth- 
ods of area measurement included transparent dot grid, plani- 
meter, and ocular grid on the Bausch and Lomb Zoom 240 stereo- 


scope. Rolatape measurements were taken in the field to de- 
termine precisely the scale of airphotos used for acreage 
measurement . 

Ground truth information from these various sources was 
assembled first in notebook form and subsequently on punched 
cards. As the analysis proceeded, it also became evident that 
time woul. be saved by having the ground truth in map form. 
Accordingly, 5X enlargements were made of the U-2 photos cover- 
ing the test area, and maps were prepared in the form of over- 
lays on acetate. 

Generalized breakdowns of the test area by field size and 
major crop types were given previously in Tables 1, 2, and 3. A 
more detailed breakdown of the ground truth information by town- 
ship is contained in Tables 4,5, 6 , and 7, The percentage of 
fields belonging to each ground cover class does not differ sig- 
nificantly between townships. However, the percentage of the 
total acreage is significantly different for corn and grass. 

Corn covers 35.8 percent of Leroy Township but only 26.0 percent 
of Locke while grass ranges from 21,1 percent in Leroy to 31.9 
percent in Locke Township. The major ground cover classes, in 
order of decreasing importance according to the percent found 
in the test site, are listed below: 


Corn 

30.3% 

Grass 

25.5% 

Woods 

16.8% 

Stubble 

9.4% 

Bare Soil 

7.2% 
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x'able 4. Ground truth for Locke Township, Ingham County, Michigan. Given in 

acres and numbers of fields. 
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Table 5 


Ground truth for Leroy Township, Ingham County, Michigan. Given in 

acres and numbers of fields. ,, 
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Table 7. Percentage of totals of acreages and number of fields for various 
ground cover' classes for each of the three townships and for the 
entire test site. 
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All other ground covers represent less than 5 percent of the 
total acreage of the area. This analysis is confounded by 
ambiguity concerning the definition of a field. A farmer 
usually refers to a unit of land as a field when it is all in- 
cluded within one fenced area with no natural barriers to cul- 
tivating the entire area in continous passes. For purposes of 
the detailed analysis above we defined fields with reference 
to the traditional definition plus requiring a single cover 
crop. Thus, if a field contained 50 percent of one crop and 
50 percent of another, it was treated as if it were two separ- 
ate fields. If dually listed as a woods-pasture or weeds and 
brush, it was placed under the category first mentioned. How- 
ever, a weedy field crop was labeled by the crop, e.g. weedy 
soybeans were called soybeans. Since fields with dual crop 
identification were arbitrarily classified by the first desig- 
nation, there may be a slight bias in the results. This bias 
is likely to be important only for the grass and trees cate- 
gories where the dual listing occured most often. 

Supplemental ground truth information was also collected 
in Eaton County, Michigan in an area previously used for ERTS 
studies. This area was situated along an extension of the 
Skylab flight path, and it was felt that additional data on 
wheat might be garnered for the photointerpretive studies 
of Skylab imagery as well as for possible use in subresolution 
element analysis. The Eaton County information was collected 
in June, 1973 on a 4 by 5 mile strip having McConnell Road as 


the southern boundary and Cochran Road as the north-south 
bisector. Information on fields in this area came from field 
visits and photointerpretive analysis of aircraft imagery. 
Additional information on all wheat and plowed fields was al- 
so obtained for a 5 by 2 mile strip immediately to the north 
of this area. Unfortunately, little Skylab imagery was ob- 
tained for this area while the bare soil and growing wheat 
conditions prevailed. 

Collection of ground truth information occupied most of 
the 1973 field season, and acreage determinations extended in- 
to the summer of 1974. However, the effort was required be- 
cause the ground truth information served as the basis for 
both the photointerpretive analysis of Skylab Imagery at MSU 
and the computer analysis of S-192 scanner data at ERIM. 

Description of Photographic Imagery 

This section contains a description of photographic imagery 
collected in support of the project by both Skylab and aircraft 
cameras. Aircraft imagery is treated first, then Skylab imagery 

The first photographic mission was flown with the ERIM 
C-47 aircraft on August 5, 1973. This flight took place simul- 
taneously with a Skylab pass over the Ingham County test area, 
and was intended primarily to provide supporting data for com- 
puter analysis of S-192 MSS data. However, it was also used 
to fill gaps in ground truth. The sensor complement for this 
mission included three cameras and the ERIM multispectural 
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scanner. The P22C camera (70mm format) was loaded With CIR 
film. The KB -8 cwvH4 (70mm format) was loaded with conven- 
tional Cblcr film- -he K-19 camera (9 inch format) Was loaded 
with black and white film. The latter camera is used primarily 

...... - ' W - 

tb prow id* v a n-ivi/iai: ional record of the mission. This mission 
included passes at several altitudes over the test area, but 
included coverage only for the central portion of the test 
strip . 

The second support mission was flown with a NASA U-2 
aircraft on August 11, 1973. The photographic product of this 
mission was 9-inch C IK Imagery at a nominal scale of 1:130,000. 
Full stereo coverage in five flight lines of Eaton, Ingham, 
Livingston, Oakland, and Macomb Counties was included, as well 
as partial coverage of Kalamazoo, Calhoun, Jackson, Washtenaw, 
Wayne, Kent, Clinton, Shiawassee, Genessee, Lapeer, and St. 

Clair counties. The 9-inch transparencies from this mission 
served as the primary reference for photointerpretive studies 
in this project. 

Aircraft imagery flown in previous years to support other 
projects such as ERTS and land use studies was also available 
for use in the Sky lab work. Details of this Imagery Will not 
be discussed here except to note that the various dates of 
photography, scales, film types, and formats collectively 
covered most of southern Michigan. 


The Skylab satellite made data collection passes over 
southern lower Michigan on three dates during 1973. Because 
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of cloud cover on September 18 only the first two provided 
ueeble Imagery for the Ingham County test area. Table 8 
contains a summary of imagery by date and sensor. 


Table 8. Summary of Skylab photographic coverage for southern 



lower Michigan. 





Date 


Mission 

Sensor 

Rolls 


June 12, 

1973 

SL-2 

S-190A 

13 

to 

18 

August 5, 

1973 

SI.- 3 

S-190A 

19 

to 

24 

August 5, 

1973 

SL-3 

S-190B 


83 


Sept. 18, 

1973 

SL-3 

S-190A* 

43 

to 

48 

Sept. 18, 

1973 

SL-3 

S-190B* 


88 


* 

Ingham County 

test area covered 

by clouds. 





Skylab 2 passed over the Ingham County test area on June 
12, 1973 (Figure 4). Because encroaching clouds indicated un- 
certain weather conditions for this pass, only the S-190A sen- 
sor system was operated. Despite the occurrence of widespread 
clouds over southern Michigan on that date, the Ingham County 
test area happened to be clear at the moment of coverage and 
usable imagery was obtained. The S-190A system consisted of 
six cameras with 70 mm format and 6-lnch focal length. These 
cameras were loaded with different film/filter combinations 
to give multiband imagery. Film types Included panatomic-X 
aerial black and white, infrared aerographlc black and white, 
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inf rared aerochrome color, and high resolution aerial color. 

u 

The combination of 6-inch focal length and 270 mile orbital 
altitude produced Imagery with a scale of 1:3,000,000. 

The prime Skylab pass for this project took place on 
August 5, 1973 during the SL-3 mission. Weather condltons were 
excellent and both the S-190A and S-190B camera systems were 
operative. The S-190A coverage is shown dlsgrammatlcally in 
Figure 5, and the S-190B coverage is shown in Figure 6. The 
S-190B camera uad an 18-inch focal length (scale 1:1,000,000) 
and was loaded with high resoultion aerial color film (S0-242). 
This pass was accompanied by a simultaneous C-47 underflight, 
and followed the next week by a U-2 support mission. Both of 
these aircraft missions were described at the beginning of this 
j section. 

On September 18, 1973 SL-3 again passed over southern 
Michigan with bot’i the S-190A and S-190B camera systems in op- 
eration. Coverage for S-190A on this pass is shown in Figure 7, 
and for S-190B in Figure 8. On this pass, however, only Saginaw 
and Huron Counties irere free of clouds. As a consequence, 
imagery over the Ingham County test area was not usable. 


Description of Multispectral Scanner Data 
On August 5, 1973 the Skylab multispectral scanner, S-192 
was operated over the Michigan test site at approximately 10:02 
EST or 15:02 GMT. Atmospheric conditions were variable and hazy. 
This date provided the only usable S-192 data because of poor 
weather conditions at the time of the scheduled June and Septem- 
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one factor that must be considered when widespread application 
of Skylab-type imagery is contemplated. Therefore, some discus- 
sion regarding a minimal configuration of interpretive equipment 
is in order. Given the small scale and orbital altitudes, stereo 
viewing adds little to the interpretive process and is scarcely 
worth the time required to orient the stereopair properly. In 
practice, the Bausch & Lomb unit was usually used in the binocular 
macroscope mode during routine interpretation. Likewise, the 
adjustable intensity of the light table and the slow motion scan 
are more in the category of conveniences than necessities. Ex- 
periments with interpretation of projected images on rear projec- 
tion screens showed this approach to be much less satisfactory 
than direct viewing of the transparencies with transmitted light. 
Thus, a minimal set of equipment for effective interpretation 
would be a zoom macroscope with oculars used over a portable 
light table. It might also be noted that the tendency of the 
transparencies to curl necessitates the use of a glass hold-down 
plate or a cardboard frame to keep the imagery flat. 

The usual clues which the photointerpreter uses to make 
identifications include tone, texture, pattern, size, shape, 
shadow, location, and association of features. As scales be- 
come smaller, with consequent loss of resolution, the geometric 
clues such as size, shape, and shadow fade until they are lost 
except for their contribution to tone, texture, and pattern. 
Likewise, textures become more uniform, and therefore less use- 
ful, at smaller scales. By any criteria one might choose, the 


the S-190A and S-190B contact scales of approximately 1:3*000,000 
and 1:1*000*000* respectively, must be classed as very small scale. 
Thus, the Interpreter is forced to rely primarily on tonal or 
color signature in identifying cover types, with some aid from 
texture in the case of forested areas and pattern in certain 
other instances. Very small scales also make annotation and 
mapping on overlays extremely difficult. 

The importance of resolution cannot be overemphasized. Al- 
though small scales can be enlarged either optically or photo- 
graphically* little is to be gained by enlargement if no further 
ability to discriminate between objects is obtained. ERTS trans- 
parencies and S-190B photos* for example* are both distributed 
to users at a scale of approximately 1:1,000*000. ERTS can be 
profitably enlarged by a factor of five to produce a working 
scale of about 1:200*000. At this enlarged scale, main roads 
can usually be distinguished and sometimes secondary roads, but 
field boundaries are indistinct. In contrast, S-190B imagery 
will stand at least 15X enlargement by photographic means and 
more than 20X by optical means. Thus, working scales ranging 
from 1:50,000 to 1:100,000 can be readily obtained. Individual 
fields are easily delineated on such enlargements. This ability 
to delineate and measure acreage on individual fields effectively 
increases the degrees of freedom for statistical estimates. For 
this reason alone the added resolution is worthwhile, even if 
lack of variation in tone between cover types limits the ability 
to classify crop species. Glassification accuracy can be improved 
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by temporal overlays keyed to a crop calendar, but accuracy of 
acreage measurement cannot. 

Interpreters on this project faced several variables such 
as differences in scale and resolution between S-190A and S-190B 
imagery, differences in film types, and in crop phenology. In 
view of this, the first step in assessing a set of imagery was 
always a first-look examination for purposes of orientation and 
general determination of image quality, cloud cover, etc. The 
second step was to formulate a set of categories that were con- 
sistently separable and to develop a key to the tonal signatures 
of the categories. The procedure for doing this was first to 
select a training set of large and distinct fields on the basis 
of ground truth information. These fields were then located on 
the imagery and their tonal signatures noted. The signatures 
for these training fields were then compared across cover types 
to arrive at a set of categories for which the probabiltiy of 
discrimination was relatively good. The third step was to run 
a test of these tentative categories to see if they could be 
recognized consistently on the imagery. In this phase, large 
and distinct fields were selected from the Skylab imagery, in- 
terpreted, and compared with the corresponding ground truth 
data. This phase might result either in acceptance of the 
tentative categories, or in some refinement of the categories. 
Next came the operational phase of interpretation in which a 
block of 3000 - 5000 acres was selected. The cover types in 
this block were classified according to the predetermined cate- 
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gories and acreages measured with the ocular grid. Cover clas- 
sification and acreage were recorded on a f leld-by-f ield basis 
for S-190B imagery, but this was not possible for S-190A Imagery 
because of the lack of resolution. The only reference to ground 
truth during operational Interpretation was for tonal signatures 
that had not been previously encountered, and for field numbers 
in the case of S-190B imagery. General notes regarding time in- 
volved and any particular problems encountered were maintained 
during the interpretation of this large block. Time study data, 
per se, was not collected during this phase. Results of inter- 
pretation for the block served as the basis for analysis of 
accuracy and assessing utility of the imagery for agricultural 
surveys. A separate time study for interpretation of S-190B 
imagery was conducted later. 

Most of the interpretation was performed by a technician 
with a Bachelors degree in forestry and a background in agricul- 
ture and use of alrphotos. It was felt that a Bachelors degree 
is some agriculturally oriented field and prior experience with 
alrphotos might be typical of interpreters employed for opera- 
tional work in agricultural surveys, whereas a Ph.D. with exper- 
ience in remote sensing research would be atypical. 

Analysis of S-190A Imagery 

The S-190A sensor system is a multiband photographic camera 
equipped to provide imagery in six spectral regions. The spec- 
ifications of the six high precision lenses are: 6-inch focal 

length with matched distortion and focal length: f/2.8; 21.2° 


FOV. The shutter assemblies are a rotary-intralens type pro- 
viding for variable aperture settings from f/2,8 to f/16 in H - 
stop increments and shutter speeds of 2.5, 5, and 10 milliseconds 
with 0.4 millisecond synchronization. Intervalometer settings 
are adjustable from 2 to 20 seconds in 2 second increments. 
Imagery is provided in 70mm format. The combination of orbital 
altitude and field of view gives ground coverage over a square 
approximately 100 statute miles on a side in each frame , with 
contact scale of 1:3,000,000. 

Four of the six camera stations were loaded with black & 
white film/filter combinations to cover four adjacent spectral 
bands of 0.1 micrometer each as follows: 

Camera 

Bandwidth (micrometers) Film Filter position 


0.5 to 

0.6 

Panatomlc-X aerial 
B&W , type S0-022 

AA 

6 

0.6 to 

0.7 

Same as above 

BB 

5 

0.7 to 

0.8 

IR aerographic B&W, 
type EK 2424 

CC 

1 

0.8 to 

0.9 

Same as above 

DD 

2 


Camera position (station) 3 carried aerochrome IR color 
film (type S0-127) with EE filter to give a CIR image in the 
spectral range 0.5 to 0.88 micrometers. Camera position 4 
carried high-resolution aerial color film (type SO-356) with 
FF filter to give a conventional color image in the spectral 
range 0.4 to 0.7 micrometers. Other filter combinations were 


available for testing, but were not used for the present study. 

The first S-190A imagery over the Ingham County test area 
was acquired on June 12, 1973 during the SL-2 mission. Although 
much of the region was clouded on that date, the test area hap- 
pened to be free of clouds. Phenologlcal conditions in southern 
Michigan on June 12, 1973 were as follows: 

Forests: Most species in full leaf; 

Grasses: Green and from several inches to over a foot 

in height, although some areas still showed 
dead material from the previous season over 
the current year's green growth; 

Small grains: Heading, mostly green; 

Row crops: Many fields tilled, some of which were planted, 

but few of which had any emergent green growth 
above 3 inches in height. Some fields not yet 
tilled. 

This set of phenologlcal conditions is relatively favorable 
for interpretation of forests and related natural vegetation. 
Conditions also favor interpretation of grasses as a group, but 
separation of species is difficult because many appear similar 
in early and middle stages of development. Likewise, separation 
of small grains from grasses would be expected to be difficult. 
Fields destined for cultivation of row crops would appear as 
bare soil if tilled, or possibly grass or small grain tilled. 
Therefore, different species of row crops are not detectable at 
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this date. 




As expected, the first-look analysis indicated that the 
general detectability of features varied by film type. On the 
two panatomlc-X films covering the spectral range 0.5 to 0.7 
micrometers, main roads shoved well and boundaries between 
contrasting types were relatively distinct. Water bodies, how- 
ever, were difficult to see. The B&W IR bands covering the 
spectral range of 0.7 to 0.9 micrometers were complementary 
to the previous bands since water bodies were easily distinguish- 
able while roads and type boundaries were indistinct. The char- 
acteristics of the conventional color image were similar to the 
black and white images covering the same spectral range, except 
that the color tones produced more variability in signatures be- 
tween cover types. 

There was considerable variation in quality, however, be- 
tween the duplicate bands of this film. Two of the duplicate 
bands were quite dark and difficult to interpret. The lighter 
duplicates were much easier to interpret . The broad band CIR 
imagery was best with respect to variety of features registered. 
On this latter type of imagery main roads showed fairly well, 
water bodies registered clearly, and boundaries between contrast- 
ing types were relatively distinct. However, resolution was 
generally poorer on the CIR imagery than for the conventional 
color. As with the conventional color, quality of reproduction 
was quite variable for CIR. 

In general, this S-190A imagery was satisfactory for in- 
terpreting gross characteristics on a regional scale. However, 


the resolution was not sharp enough to allow consistant inter- 
pretation or measurement on a field-by-field basis. Field 
boundaries could not be delineated unless a field was surrounded 
by other fields that would produce a sharp contrast on the film. 
Under ideal conditions, such as with bare soil in adjacent fields, 
a field as small as one acre could be delineated; but this was 
strictly the exception rather than the rule* A general idea of 
location was possible because the major roads and freeways could 
be seen. However, most section-mile roads could not be seen, 
and one could not tell where the different sections began or 
ended. This lack of resolution created a problem in locating 
specific areas on the imagery for quantitative tests. 

Since first-look Interpretive efforts substantiated the 
general separability to be expected from phenological conditions 
on June 12th, only three categories were used: 

1) bare soil 

2) forest/brush 

3) Grass, forage crops, small grains, etc. 

Signatures on the conventional color film were brown for soil, 
green for crops/grasses, and a darker green for forests. Color 
categories on the CIR film were white for soil, red for vegeta- 
tion, and black for wet areas. Different shades of red were not 
detectable for the different crops. Futhermore, crops and forests 
looked essentially the same shade of red on the CIR except that 
lowland hardwoods had a tinge of black. There was some confusion 
between forests and wetlands on the CIR as will be explained later. 


The difficulty in locating specific areas on the June 
12th S-190A imagery has already been mentioned. Such location 
was essential* however* in order to conduct quantitative tests 
of interpretive accuracy. The difficulty was resolved by using 
distinctively shaped woodlots situated at the corners of an 
8-section rectangle in Leroy Township to establish a block of 
area which could be compared with ground truth. Cover types 
were identified in the three categories and acreages measured 
from both the color and CIR S-190A imagery. The results of 
these tests are summarized in Table 9. 

In all cases except the forest category as Interpreted 
from CIR, there was a consistent underestimation of approxi- 
mately 25 percent. Upon further Investigation of the anomaly 
involving forests on CIR* it was discovered that the techni- 
cian was actually using a signature produced by seasonally 
wet areas instead of forests. The confusion arose because 
many of the forests in the area are lowland hardwoods. On 
the whole* the color film was judged to be more easily inter- 
pretable for most categories than CIR due to somewhat better 
resolution in the color film. The reason for the consistent 
bias toward underestimation was not fully determined* but it 
is suspected to have arisen from the difficulty in delineating 
type boundaries. Given this consistent bias, however* the use 
of a ratio correction factor seems appropriate. If the ratio 
correction of 1.25 is applied to the interpretations from the 
color film* the results are as shown in Table 10. Therefore* 
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these tests of three categories as Interpreted from the June 
12 , 1973 S-190A color film Indicate approximate occur cies of 
75 percent before ratio correction and over 90 percent after 


ratio correction. In practice, the development and use of 
a ratio estimator of this type would imply some sort of double 
sampling ay stem for developing the correction factor. The 
second phase of the sampling could be based on aircraft imagery. 


ground imagery, or some combination of the two . Sue to the 
inabiltly to distinguish Individual fields Or even sections 
consistently, standard errors are not available to support 


the results presented in Tables 9 and 10. 

A second set of S-190A Imagery over the Ingham County 
test area resulted from a Shy lab pass on August 5, 1973. 
Weather conditions over most of southern Michigan on this 
date were good, and general phonological conditions were as 
follows: 

forests: all species in full leaf with closed canopy; 

grasses: most species starting to ssneace; 

Small grains: mature, senescent - with some already 


\ 

I 




harvested; 

Row crops: in late stages of growth with nearly complete 

ground cover, and part of corn in tassle. 

The results of the first-look analysis for the August 
5th S-19 QA imagery were similar to those for the June 12th 


imagery with respect to detectability of features on the 


w 


several types of film. Variability In either exposure or 
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processlng for the S-190A imagery was again evident. In 
this case, the conventional color fil* was very dark. Bare 
aoll could be distinguished fairly readily even on the dark- 
ened iaasery , but the ability to make other distinctions such 
aa between forested areas and crops was serious ly impaired, 
larger roads Were recognizable , but with difficulty . Because 
of this poor quality, the color images fro* the August 5th 
pass were not used in quantitative teats. The OIR film for 
the August 5th pads was of better quality than that for June 
12th. Howevbr, there was again a rapid decay in clarity with 
magnification and location was difficult because of Inability 
to distinguish sec t i on -mi 1 e roads* As with the June 12th 
inagery , Individual fields could not be distinguished consis- 
tently which meant that comparisons with ground truth data 
were limited to a large-area basis. 

There were three distinct shades of red on the August 
5th CIR imagery: dark red, a dull red, and a bright red. The 
dark red was a hardwood forest signature. It was initially 
thought that the bright red would correspond with a specific 
crop. However, this did not prove to be the case. For example, 
this shade of red represented such diverse cover types as al- 
falfa field, grassy area, and weed-filled soybean field. The 
dull shade of fed also included a variety of crop types. There- 
fore , the bright and dull shades of red were pooled into a 
category that included crops , greases, end miscellaneous types* 

■ ' Acreage estimation was again done on the same block of 
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eight sections in Leroy Township used for the tests with June 
12th imagery. In this case, it was felt that the shifts in 
phenologlcal conditions and color signatures between June 12th 
and August 5th were sufficiently large that the tests would 
be essentially independent despite use of the same area. The 
same three categories were used as for the June 12th Imagery. 
The results of this test are presented in Table 11. It should 
be noted that the total acreages from ground truth are slightly 
different in Table 9 and Table 11, even though the same area 
is used for both tests. In the case of the tests on June 12th 
imagery presented in Table 9, the early phenologlcal condi- 
tions and lack of an underflight early in the growing season 
made it necessary to rely on the ASCS certification records 
in conjunction with a June field visit for ground truth. For 
the August data presented in Table 7, however, the U-2 under- 


Table 11. Results of photointerpreting S-190A imagery taken 


on August 5, 1973 for eight sections in 
ship, Ingham County, Michigan. 

Leroy Town- 

Category 

Acres from S-190A 
CIR imagery 

Acres from 
ground truth 

percent 

accuracy 

Bare soil 

432 

557.5 

77% 

Tores t 

"955 

889.7 

93% 

* * 

Grass , forage , 
crops, etc. 

3028 

3219.2 

94% ’■ 

Overall 

4415 

4666.4 

95% 
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flight from August 11, 1973 was used for scaling acreages and 
was supplemented as needed by the August field visit. The diff- 
erences in source of "ground truth" account for the discrepan- 
cies in total acreage. Since the chances for uncontrolled re- 
cording errors and variation in scale of enlarged photocopies 
are greater, the ASCS certification records are less reliable 
than the U-2 underflight. 

Overall accuracies for the forest and generalized crop 
categories in the August 5th test are 93 percent and 94 per- 
cent, respectively. Ratio correction was not necessary in this 
case. A probable explanation for the lack of negative bias 
in these categories for the August 5th test comes from the 
interpreter's prior experience with S-190B imagery. It is 
suspected that the knowledge of negative bias in the first 
effort made him more conscious of the need for interpolation 
in drawing boundaries between adjacent types in the second 
test. This variability of the interpreter's performance with 
experience again underscores the need for a double-sampling 
approach if low resolution imagery such as that from the 
S-190A is to be used in practical crop surveys. The final 
point relates to the underestimation of the bare soil cate- 
gory in the August 5th test giving an accuracy of only 77 
percent. Interpreter bias is not a likely explanation In 
this case since the bare soil signature was quite distinctive. 
The amount of bare soil present on August 5th was relatively 
fjffiall and plowing of two or three large fields (in preparation 




slow process because of difficulty In locating specific points 
in reference to maps or aircraft imagery. Separation of crop 
types could not be accomplished using imagery from a single 


date. Inability to distinguish individual fields would make 
development of temporal overlays of imagery from multiple 
dates difficult. 


Analysis of S-190B Imagery 

In addition to the S-190A multiband camera system, the 
Skylab EREP package also included the S-190B earth terrain 
camera. Salient features of the S-190B camera are f/4 lens 
with focal length of 18 inches, intervalometer settings from 
0 to 25 frames per minute, shutter speeds of 5.7 and 10 milli- 
seconds, and compensation for forward motion through program- 
med camera rotation from 0 to 25 milliradians/second. Film 
format is 5 inches with a 4.5 inch square image. This for- 
mat with the 270 statute mile orbital altitude gives ground 
coverage of a square approximately 68 statute miles on a 
side at a contact sc:<le of approximately 1:1,000,000. For 
purposes of the present project, the S-190B camera was loaded 
with SO-242 high-resolution aerial color film sensitive to 
wavelengths in the .4 - .7 micrometer region of the spectrum. 

The S-190B camera system was operated over the Ingham 
Count/ test area on August 5, 1973 and September 18, 1973. 
Since the test area was covered by clouds on September 18, 
analyses could be performed only on the August 5th imagery. 

A comparison of the S-190A and the S-190B imagery clearly 



shows that substantially better resolution was obtained with 
the S-190B system. Section-mile roads and most field bound- 
aries are readily visible on the S-190B imagery when viewed 
under the magnification afforded by the Bauseh & Lomb Zoom 
240 equipment. Thus, measurement and comparison with ground 
truth can be performed on a f ield-by-f ield basis, including 
calculation of standard errors. 

Phenological conditions existing on August 5, 1973 for 
the southern Michigan area have already been described in 
connection with analysis of S-190A imagery and will not be 
repeated here. Sections 2 - 8 of Locke Township in the Ing- 
ham Gounty test area were used for the quantitative tests of 
photointerpre tat ion with the S-19QB imagery. 

Two aspects of photointerpretive analysis for crop 
acreage assessment must be considered. The first is accuracy 
of acreage measurement* Area measurements from the Skylab 
imagery were performed under 15X magnification with an ocular 
grid on which the lines are spaced 0.25 mm apart. Given good 
equipment such as this, accuracy of acreage measurement is 
primarily a function of resolution and field size. The re- 
sults of the quantitative tests on S-190B imagery with respect 
to accuracy of acreage measurement are summarized in Table 12. 
The figures contained in the columns of Table 12 labeled 
"no. of fields", "total acreage from ground truth", and "total 
acreage from S-190B" are self-evident. "Aggregate % error" 


is calculated as: 



Agg. % error = 


Total acr* from S190B - Total acr. from ground truth 


Total acreage from ground truth 
"Average signed error (acres)" is calculated as: 


Avr* signed error (A) = 


Z measured field size - actual field size 


w number of fields 

"Average unsigned error (acres)" is calculated as: 

* , , , . v % I measured field size - actual field size 


Avr. unsigned error (A) = ■■ 1 

"Average % error" is calculated as: 

o t measured fie 
100 x Z 


number of fields 


Avr* % error = 


measured field size - actual field size i 

actual field size 

number of fields 


For calculation of "Std. Dev. of error (A)", the formula is: 
_ s X 2 - a X) 2 /n 


Std. Dev. - 


n - 1 


vrhere 


X * measured field size - actual fid. size 


n = number of fields 

The same formula applies for Std. Dev. of % error except that 
100 x (measured field size - actual field size) 



A — 


actual 

field size 




There 

is a slight 

tendency 

toward unde 

restimation as re- 

flee 

ted in 

the overall 

aggregate 

percentage 

error 

of -6%. 

Only 

in the 

ease of fie 

Ids less 

than ten ac 

res wa 

s there a 

tend 

ency to 

over es t j ma t 

e acreage 

from the S 

-190 B 

imagery . 

This 

small 

tendency tow 

a rd under es tima t ion 

can be 

attributed 

to 1 

imlts o 

f resolution 

along th 

e field boundary, 

and should 

not 

cons tit 

ute a limita 

tion for 

use of the 

imager 

y in acre- 

age 

assessment since a 

correct!© 

n for bias 

could 

be obtained 


Table 12. Results of testa on accuracy of acreage measurement from S-190B Imagery taken on August 5, 1973 
based on analysis of 7 sections In Locke Township, Ingham County, Michigan, 


Field size 

Mo, of 

Total A* 

Total A. 

Agg. % 

Avr signed 

Avr unslgn 

Avr % 

Std. D?v. 

Std. Dev 

(acres) 

fields 

Gr, Truth 

S-190B 

error 

error (a) 

error (A) 

error 

error (A) 

% error 

0-10 

**5 

295.1 

300.0 

1.66 

.11 

1.6 

28.98 

2.13 

26.73 

1CH- - 20 

64 

962.0 

952.5 

-.99 

-.15 

2.9 

19.50 

3.79 

15.27 

20* - 30 

30 

721.1 

690.4 

-4.26 

-1.02 

2.6 

10.74 

3.42 

9.44 

30* - 40 

16 

549.7 

460.0 

-16.32 

-5.61 

5.8 

16.74 

5.10 

13.86 

40* - 50 

5 

222.0 

200.3 

-9.77 

-4.34 

6.0 

13.26 

7.77 

13.11 

50+ - 60 

2 

101.9 

69.6 

-31.70 

— 16.15 

16.2 

31.63 

5.02 

9-37 

60* - 70 

1 

60.7 

60.5 

-.33 

-.20 

.2 

.33 

— - 

— 

70* - 80 
80f - 90 
90* - 100 

5 

367.3 

331.1 

-9.86 

-7.24 

7.2 

9.94 

4.62 

6.68 

1 

91.9 

75.6 

-17.74 

-16.30 

16.3 

17.74 

— 

— 

100* 

1 

209.2 

196.6 

-6.02 

-12.60 

12.6 

6. GO 

— 

— 

Overall 

170 

3580.9 

3336.6 

-6.82 

-1.44 

3.28 

19.68 

4.7 

18.92 
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through a relatively small sample of fields. Although the 
smaller base for the percentage gives an inflated percentage 
error for individual fields less than 20 acres, there is no 
noticeable decline in ability to measure actual acreage as 
fields become smaller within the usual range of field sizes. 
Since a rather small percentage of the total crop is produced 
in fields less than 5 acres, the resolution obtained with 
S-190B is judged to be adequate for purposes of crop acreage 
assessment for major field crops. Further improvements in 
resolution would, however, be useful when working with minor 
crops such as vegetables which are often grown on small plots 

Besides measuring acreage of fields, one must be able 
to identify crop type from the imagery in order to do crop 
acreage assessment. The S-190B imagery has less utility in 
this regard than for acreage measurement per se. 

The self-training procedure for the interpreter, as 
described earlier, involved the following steps: 

1) Selection of training fields from ground truth; 

2) Location of training fields on the imagery and 
description of their color signatures ; 

3) Correlation of signature with crop type and 
development of tentative categories for interpre- 
tation ; 

4) A test from imagery to ground truth to verify 
the suitability of categories. 

The results of this training procedure as applied to the 


August 5th S-190B imagery showed that bare soli had a distinc- 
tive signature that was whitish in tone. Forested areas were 
easily recognized by virtue of a very dark green color and 
rough texture. Senescent vegetation such as uncut oats and 
wheat had a light brown color which could be quite readily 
detected. The stubble of cut hay and small grains had a 
large contribution of soil to its signature along with a 
sparse cover of vegetation which combined to give a very 
light green tone. Mature corn which was well- tasselled gave 
a brownish-green signature. However , no other categories 
were consistently separable since all other crops along with 
corn which had not yet tasselled had a medium green tone. 

Classification results for the operational test over 7 
sections in Locke Township are presented in Table 13. In 
this table, fields are grouped into five categories according 
to ground truth information. The combined "stubble and senes- 
cent vegetation" category includes all small grains along with 
some recently cut fields of hay. All crops with the exception 
of small grains, stubble fields, and corn are included in the 
"other" category along with such miscellaneous types as non- 
forested wetlands and farmsteads. 

Sixteen of the 18 bare soil fields were correctly clas- 
sified for an accuracy of 89 percent. The other two bare soil 
fields were misclassif led as stubble, probably due to the in- 
vasion of sparse weeds which darkened the signature. 

The fact that 15 of the 45 corn fields were not recog- 



Table 13 .Classification results for August 5» 1973 S-190B Imagery based on Interpretation of 7 sections 
In Locke Township, Ingham County, Michigan. 


Category from 
ground truth 

No. of 
fields 

Total 

acres 

Correctly classed 
# fields * fields 

Soil 

_No._of fields_wrongly_classified 
Corn Br7*oods Stub/senesc 

as_-- _ 
Other 

Bare soil 

18 

252.3 

16 

89* 

0 

0 

0 

2 

0 

Corn 

45 

1100.1 

25 

56* 

0 

0 

0 

0 

20 

Brush/woods 

17 

399.4 

17 

100* 

0 

0 

0 

0 

0 

Stubble/senescent 

20 

345.1 

12 

60* 

0 

3 

0 

0 

5 

Other 

70 

1514.0 

63 

90* 

1 

2 

0 

4 

0 

Overall 

i?0 

3580.9 

433 

78* 

1 

5 

0 

6 

25 


! 
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nized as corn is due to the presence of dual signatures for 
this crop as mentioned earlier. The tasselled corn was clas- 
sified correctly due to the distinctive brownish-green signa- 
ture, while the fields of corn which had not yet tasselled 
were classified in the "other" category. 

Twelve of the twenty "stubble and senescent vegetation" 
(uncut small grain being the main type of senescent vegeta- 
tion at that date) fields were classified correctly for an 
accuracy of only 60 percent. Lack of accuracy in this cate- 
gory is not too surprising due to the variable nature of the 
targets. Grass and low weeds below the level of the cutter 
bar may contribute a green cast to the signature, which ac- 
counts for the five fields misclassif led as belonging to the 
"other" category. Also, the presence of wheat or oat straw 
on such low weeds can give a cast similar to that of tassel- 
led corn, which accounts for the three fields classified as 
belonging to the "corn" category. 

There are a variety of possible reasons for errors of 
classification in the "other" category which will not be 
discussed here. 

Difficulty in discriminating crop types on the basis 
of color signature alone from conventional color imagery is 
not surprising. Several support missions were flown for 
ERTS investigations at MSU with the RB-57 high-altitude air- 
craft. These missions Included simultaneous coverage with 
both conventional color and C1R Imagery at a scale of ap- 
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proxlmately 1:120,000. The signatures of cover types on the 
conventional color Imagery were quite washed-out and exhibited 
little contrast between cover types in comparison to the CIR 
imagery. With the RB-57 Imagery, however, texture provides 
more supplemental clues to Identification than is the case 
with S-190B Imagery from Skylab. 

An overall assessment of the S-190B Imagery for purposes 
of crop acreage assessment is that resolution is adequate for 
acreage measurement, but the natural color film does not pro- 
vide enough contrast in signatures between cover types to al- 
low detailed classification from Imagery obtained on a single 
date. Since the tests did show quite distinctive signatures 
for bare soil and senescent vegetation, temporal overlays are 
a good possibility for obtaining more detailed classifications. 
Winter wheat, for example, is green in late fall, green in spring, 
and becomes senescent In mid-summer. Fields devoted to other 
small grains would be bare soil during the planting season, 
green during the early growing season, and senescent in mid- 
summer. Corn and beans would both show a pattern of transi- 
tion from bare soil to green from spring to summer, but corn 
could be distinguished at the tassel stage. Since cloud-free 
S-190B imagery over the Ingham County test area vps only ob- 
tained for one date, there was no opportunity t irsue the 
question of temporal overlays in this project. 

Another means to obtain better detail of classification 
is through development of CIR film with a resolution equiva- 


lent to that of the SO-242 hlgh-reaoXutlon aerial color film. 
CIR film gives more variation In signatures between crop types 
and is less affected by atmospheric haze than conventional 
color film. 

Since time required to interpret the S-190B imagery In- 
fluences both cost of surveys and speed with which the results 
become available, this aspect must also be considered. After 
the interpretations discussed previously had been completed, 
the interpreter performed a time study for which the results 
are summarized in Table 14. 


Table 14. Results of time study for interpretation (Including 
acreage measurement) of S-190B Imagery. 


Field size 
acres 


No. of Avr. interpretation 

fields time per field (sec. 

58.4 


Std. Dev. 


33.4 


67.4 


29.2 


80.8 


51.1 


87.2 


37.0 


overall 


62.7 


34.1 


Time required for interpretation of a field increases 
somewhat with its size as expected, but this increase is not 
linear. In fact, other variables such as shape of field and 
contrast along the borders are as Important as field size in 
determining time required for Interpretation. Therefore, the 






overall time of 63 seconds/f ield with standard deviation of 
34 seconds provides a reasonable figure for cost calculations. 
Using the average field size In the test area of about 20 acres 
(Table 1), there are approximately 32 fields per standard sec- 
tion (640 acres). Thus, the expected time required to inter- 
pret a section is approximately 34 minutes; and about 20 hours 
would be required per township. It should be noted that these 
figures apply only to thoroughly trained and experienced in- 
terpreters. The actual time required in an operational set- 
ting would probably be somewhat longer because the interpre- 
ter would become progressively more tired If he/she interpre- 
ted for a full eight hour day instead of for the partial day 
spent on photo-interpretation in this study. 

Development of Signatures .Multispectral 


Scanner Data 

Signature extraction was performed on field center pixels 
to obtain pure signals. Field center pixels contained no 
boundary areas and exclude mixtures except for those classes, 
i.e. urban areas, that contain a more uniform, defined mix- 
ture. Each spectral signature consists of a mean vector and 
a covariance matrix calculated from selected SDOs. The pro- 
cedure employed to extract the recognition signature is des- 
cribed in this appendix. 

The test area included 90 sections which was divided into 


two portions, the northern portion containing 40 sections and 
the southern 50. The northern 40 sections were used as the 



training 

site 

• One objective 

was to 

study 

the relationship 

between 

recognition accuracy and the 

amount 

of information 

used for 

training. Therefore, 

three 

sets o 

f recognition sig 

natures 

were 

formed using 10 , 

20, and 

all 40 

sections for 

training 

. To 

select the 10 and 20 se 

etion 

subsets used, all 

40 secti 

ons were numbered and 

ranked 

according to a random 

number t 

able . 

Table 15 gives 

the res 

ult tng 

rankings. The 

first 10 

and 

20 sections were 

used to 

form 

the 10 and 20 

section 

signa 

ture sets. 




Tab 

le 15 

. Ranking of 40 

1 northe 

irn sec 

tions used to 



select 10 and 

20 sec 

: t i o n s 

ubsets . 


RANK 


SECTION 


SECTION RANK 


SECTION RANK 


SECTION 


Locke 11 
Leroy 5 
Locke 20 
Leroy 6 
Locke 29 
Leroy 4 
Leroy 11 
Locke 35 
Leroy 9 
Locke 19 


Locke 7 
Locke 22 
Locke 15 
Leroy 3 
Locke 6 
Locke 32 
Locke 34 
Locke 31 
Locke 8 
Locke 33 


Leroy 10 
Locke 16 
Locke 30 
Leroy 2 
Locke 21 
Locke 4 
Locke 9 
Locke 14 
Locke 2? 
Leroy 7 


Locke 18 
Locke 2 
Locke 23 
Locke 17 
Locke 3 
Locke 10 
Locke 5 
Locke 26 
Locke 28 
Leroy 8 


Signature extraction was completed by the use of a 
clustering algorithm implemented at ERIM.— ^ For the Skylab 


If Horowitz, H.M. , Lewis, J.T., Pentland, A.P. "Estimating 

Proportions of Objects from Multispectral Scanner Data," Report 
No. 109600-13-F, Environmental Research Institute of Michigan, 
May 1975. 
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S-192 data, a supervised form of clustering was used which 1 

j 

clustered field center pixels of each class or subclass in- j 

dependently. Thus, several distinct signatures were produced | 

for each class and no signature was contaminated with pixels j 

from other classes. Trees and brush were differentiated and 
clustered as separate subclasses. Also, the various forage 
subclasses were clustered as six different subclasses: grass, 
pasture, weeds, clover, stubble, and alfalfa. 

Clustering was performed on 12 of the SDOs, selecting 
one SDO from each detector. SDO 18 had many large anomalies 
which served to confuse the results and was omitted. The SDOs 
used for clustering were: 2, 4, 6, 8, 10, 12, 14, 17, 19, 20, 

21, 22, The clustering procedure created 24 signatures for 
the 40 section set, 19 for the 20 section set and 13 for the 
10 section set. The distribution of the signatures is given 
in Table 16. 

Since the cost of classifying is highly dependent upon 
the number of signatures and the number of bands of SDOs, the 
next step was to reduce the number of signatures. The three 
urban signatures were discarded because the urban area repre- 
sented only a small portion of the scene, was located in part 
of one section and because the main interest was in discrimi- 
nating among the agricultural ground covers. 

An expected performance matrix was generated for the 


linear rule classifier for the remaining signatures, and 
signatures were combined when they appeared redundant. Much 


Table 16. Number of cluster signatures created for 
each ground cover class of designation 
for the 40, 20, and 10 section training 


20 ! 10 

SIGNATURE LABEL I SECTION ! SECTION SECTION 


CORN 


BARE SOIL 


STUBBLE 


ALFALFA 


TREES 


BRUSH 


SOYBEAN 


GRASS 


PASTURE 


WEED 


CLOVER 


URBAN 


TOTAL 


of the redundancy was due to the fact that some ground cover 
Glasses were represented by more than one designation or label, 
and thus two cluster signatures were formed for what represented 
only one spectral class. Examination of the signatures indi- 
cated that the weed signature was highly correlated with one 
of the grass signatures, and a pasture signature was highly 
correlated with another grass signature. Some signatures were 


completely discarded if the expected performance matrix lndi- 












cated that pixels forming that particular cluster signature 
would be recognized by other signatures from the same ground 
cover class. This was true of the clover signature, a pasture 
signature, and the stubble signature given that stubble is 
spectrally similar to grass and should be recognized by sig- 
natures from that class. The final signature set consisted 
of 15 signatures including 4 corn, 2 tree, 1 brush, 1 alfalfa, 

1 soybean, 1 bare soil, and 5 grass. 

Although the cost of classifying is dependent upon the 
number of signatures, the reduction in number accomplished by 
the procedure described above could not be accomplished in an 
expedient manner when required. Therefore, the original clus- 
ter signatures were used as the final signatures for the 20 
and 10 section training. Thus, though fewer clusters signa- 
tures were formed for the 20 and 10 section training set, the 
final 20 section signature set had more signatures than the 
final 40 section set. 

To further reduce the cost of the classifier, the number 
of signal bands were reduced. The tradeoff involved here is 
that the fewer number of channels used the lower the cost of 
processing, while increased accuracy comes from using a great- 
er number of channels. First, the channels were ranked accord- 
ing to a criterion based on the average pairwise probability 
of misclassif ieation. The best band was selected, then the 
band which with the one chosen is best, etc. We calculated 
the theoretical probability of misclassif ieation (POM) as a 


function of the number of channels in the decision rule and 
chose the best n channels where the decrease in POM between 
using n and n + 1 channels became less than 0.005. This 
analysis indicated that SDOs 6, 19, and 20 provided little 
aid in discriminating between major ground cover types in 
this data set so they were excluded from further study. 

The SDOs used for each training set are given in Table 17. 


Table 17. SDOs used for the 40, 20, and 10 
section recognition signatures. 


Training Set 
40 Section 
20 Section 
10 Section 


SDOs 


Total No . of SDOs 


10, 

12, 

17, 

19, 

20 

7 

10, 

12, 

14, 

17, 

19, 20 

8 

12 

» 14 

, 17 

» 19 


6 


Root mean square (RMS) errors were calculated to evaluate 
the performance of proportion estimation in both the northern 
and southern portions of the test site. Since recognition 
results were calculated section-by-section and then aggregated 
by class for the test area under consideration, RMS errors 
were calculated in two ways. One way is by recognition class 
aggregated over the test area; this is a measure of overall 
performance. The second was seetion-by-section for an indiv- 
idual class. 

RMS errors were calculated as follows: 


1 « o 

ERMS - ~ l (P. - P.) 



where 

“ ground truth proportion for one recognition class 
for the test area (or for one section) 
p^ * estimated proportion for the sane recognition 
class for the test area (or the same section) 

N - number of recognition classes (or number of sections) 
considered . 

When N represents recognition classes* there usually is 
no qualifying decriptlon added to "RMS error" in the main body 
of text. When N represents sections, the RMS error is identi- 
fied as "section-by-section RMS error." 


Recognition Results of Ap.pl 


Signatures to S-19.2 Data 


The results are developed as two subsets. One is for the 
northern area where the data was used to obtain the signatures. 
The other is for the remaining or southern area which is contlg 
uous with the northern area. Recognition results for the north 
ern 40 sections will be referred to as local recognition, re- 
gardless of the number of sections used for training, since 
the signatures are based on information from the area even if 
not from every section. In all cases, the ERIM linear de- 
cision rule— ^ was used with a threshold corresponding to an 
infinitesimal probability of false rejection of signals from 


Crane, R. , W. Richardson, R. Rieber, and W.A. Malila, 
"A Study of Techniques for Processing Multispectral 
Scanner Data," Report No. 31650-155-T, Environmental 
Research Institute of Michigan, January 1973. 


the a sensed multivariate normal distributions. As discussed 
in the preceding section, a number of recognition signatures 
were obtained for each recognition class through the use of 
a supervised clustering procedure. 

To obtain pure signals for various ground cover classes, 
field center pixels were defined so as to contain no boundary 
elements and in general exclude mixtures except for certain 
classes which contain more uniform, defined mixtures, l.e. 
urban areas. Results are first reported for recognition of 
only field center pixels since discrimination of ground covers 
will be optimum if the signals are from pure ground cover 
classes, instead of composites of several ground cover types. 
Results for recognition of whole areas, including boundary 
pixels, are reported later. 

Tables 18 - 20 present performance matrices obtained for 
field center pixels using 40, 20, and 10 sections for training 
Each matrix indicates both the number of pixels in each ground 
truth class and how these pixels were apportioned among the 
various recognition classes by the decision algorithm. At 
the bottom of each table are the percentage of total pixels 
recognized by each recognition signature and the percentage 
of field center pixels belonging to each signature class ac- 
cording to the ground truth. The major ground truth classes 
are corn and grass, each with a third or more of the field 
center pixels. The recognition class "forage" included grass, 
alfalfa, and/or stubble signatures. 



Table 18. Perf ormauce matrix for classification of field center pixels 
from north 40 sections using 40 sections for training. 


Ground truth 
class 


Forage 

(Grass 398) 
(Alfalfa 23) 
(Stubble 53) 

Tree/Brush 
(Trees 24) 
(Brush 68) 

Bare soil 

Soybean 

Urban 


Percent of field center pixels assigned to recognition class: 
No. 

Pixel Corn Forage Tree/Brush Bare soil Soybean 


Total 1036 

Excluding urban 9 6 7 


Cor n 

73.0 

8.9 
(7 
(21 
(15 


35.5 

33.9 


(16.2) 


Ground truth ( X ) 
Excluding urban 


33.2 

35.6 


RMS error in proportion estimation (X) * 3.12 

2.57 (Excluding urban) 

Overall percent correct classification of pixels = 70. OX 

7 5. OX (Excluding urban) 






Table 19 


Performance matrix for classification of field center pixels 
from north 40 sections using 20 sections for training. 


Percent of 

field center pixels 

assigned to 

recognition 

class : 

Ground truth 

No. 






class 

pixels 

Corn 

Forage 

Tree/Brush 

Bare soil Unclassified 

Corn 

344 

83.7 

10.8 

3.4 

0.9 

1.2 

Forage 

474 

8.6 

86.3 

1.7 

1.5 

1.9 

(Grass 398) 


(7.5) 

(88.5) 

(2.0) 

(0.5) 

(1.5) 

(Alfalfa 23) 


(34.8) 

(60.9) 

(0.0) 

(0.0) 

(4.4) 

(Stubble 53) 


(5.7) 

(81.2) 

(0.0) 

(9.4) 

(3.8) 

Tree/Brush 

92 

35.9 

14.1 

45.7 

0*0 

4.3 

(Trees 24) 


(25.0) 

(12.5) 

(62.5) 

(0.0) 

(0.0) 

(Brush 68) 


(39.7) 

(14.7) 

(39.7) 

(0.0) 

(5.9) 

Bare soil 

38 

0.0 

5.3 

0.0 

86.8 

7.9 

Soybean 

19 

84.2 

5.3 

10.5 

0.0 

0.0 

Urban 

69 

44.9 

31.9 

0.0 

14.5 

8.7 

Total 

1036 

39.5 

46.7 

6.2 

5.1 

2.5 

Excluding urban 

967 

39.1 

47.8 

6.6 

4.4 

2.1 


Ground truth (%) 


33.2 

45.8 

8.9 

3.7 

8.5 

Excluding urban 


35.6 

49.0 

9.5 

3.9 

2.0 


RMS error In proportion estimation (Z) “ 4.14 

2-11 (Excluding urban) 

Overall percent correct classification of pixels « 75 .11 

79. 8Z (Excluding urban) 


» 




Table 20. Performance matrix for classification of field center pixels 
from north 40 sections using 10 sections for training. 
Percent of field center pixels assigned to recognition class: 
d truth No . 


Ground truth 
class 


Forage 

(Grass 398) 
(Alfalfa 23) 
(Stubble 53) 

Tree/Brush 
(Trees 24) 
(Brush 68) 

Bare soil 

Soybean 

Urban 


Total 1 ' 

Excluding urban i. ./ 


pixe Is 

Corn 

Forage 

Tree 

344 

82.9 

10.8 

2.9 

474 

16.0 

(15.3) 

(43.5) 

(9.4) 

78.3 

(80.7) 

(47.8) 
(73.6) 

2.3 

( 

( 

( 

92 

64.1 

(83.3) 

(57.4) 

10.9 

(12.5) 

(10.3) 

16.3 

( 

(2 

38 

0.0 

52.6 

0.0 

19 

79.0 

21.1 

0.0 

ff' 

15.9 

62.3 

0.0 


14.5 


43.1 

45.0 


46.8 

45.7 



Ground truth (%) 
Excluding urban 


33.2 

35.6 


45.8 

49.0 


RMS error in proportion estimation (Z) * 5.65 

5.18 (Excluding urban) 

Overall percent correct classification of pixels “ 67. 0Z 

71. 3Z (Excluding urban) 
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Examinatlon of Table 18 reveals that correct recognition 
with 40 training sections ranges from 83.7 percent for grass 
to 42.6 percent for brush, with 73.0 percent for corn and an 
overall average of 70.0 percent. The urban class did not 
have a specific recognition signature and was recognized by 
forage, bare soil and corn signatures. Bare soil and grass 
would be expected in an urban scene, but the corn detections 
must represent mixture pixels, such as mixtures of trees or 
shrubs and grass. Many of the pixels recognized as bare soil 
probably represent concrete and buildings. Since exclusion 
of urban areas is a common procedure in agricultural applica- 
tions, the totals in Table 18 exclude the urban pixels. The 
performance matrix indicates that 69.8 percent of the stubble 
pixels were recognized as forage, with most of the remainder 
split between bare soil and corn. That some stubble would 
be recognized as bare soil is not surprising since newly mowed 
stubble would contain much exposed soil. Many missed detec- 
tions of brush were due to the corn signature, with as many 
brush pixels being incorrectly assigned to corn as were cor- 
rectly recognized as tree/brush. Corn missed detections 
tended to be assigned to tree/brush signatures with some pixels 
misclassif led as grass. Missed detections for the remaining 
ground cover classes were largely due to the corn and forage 
signatures. 

Ground truth and recognition percentages have been cal- 
culated for two cases. First, urban areas were included and 
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con, s ider ed to belong to the unclassified category and then 
the percentages were recalculated excluding the urban areas 
and urban pixels. A comparison of the recognition and ground 
truth percentages for the different ground truth classes shows 
close agreement for all but urban (unclassified), even for 
tree/brush which had only 42.6 percent correct recognition of 
brush field center pixels. The root mean square (RMS) error 
in overall proportion estimates in only 2.6 percent, excluding 
urban areas. Thus, compensating errors produced accurate es- 
timates of the signature-class proportions of field center 
pixels. Proportion estimates for whole areas, including 
boundary as well as field center pixels, are discussed below. 

When 20 sections were used for training, the signatures 
were not formed in the same manner as for the 40 section train- 
ing. This was explained in the preceding section as was the 
fact that a different number of S-192 channels was used for 
recognition. Although different numbers and types of signa- 
tures were obtained, a constant set of recognition classes 
was maintained except for soybeans which did not have enough 
field center pixels in either the 20 or 10 sections to form 
a signature. Also, there were not enough pixels for a speci- 
fic alfalfa signature for the "forage" recognition class in 
any but the 40 section case. 

Table 19 is the performance matrix for recognition over 
the northern 40 sections using the 20 section signatures. 
Correct recognition ranges from 88.5 percent for grass to 


39.7 percent for brush, with 83.7 percent for corn, 86.8 per- 
cent for bare soil, and an overall average of 75.1 percent. 
Bare soil recognition accuracy improved substantially, with 
none of the bare soil being misclassif led as corn as com- 
pared to 13.2 percent with the 4Q section training. However, 
tree recognition accuracy reduced by 13 percentage points. 

For the 40 section training, forage signatures were respon- 
sible for meny of the missed tree detections (20.8 percent), 
but with the 20 section training, corn was responsible for 
twice as many missed detections as forage, 25 percent versus 
12.5 percent. Although the percent of brush correctly recog- 
nized is the same as for the 40 section training, the number 
of pixels misclassif led as corn Increased. On the other hand, 
corn recognition accuracy Improved by 10 percent. As a result 
the percentage of field center corn pixels was overestimated 
by 4 to 6 percent. 

Since there were no signatures for soybeans or urban, 
these two ground covers could not be correctly recognized. 
Urban pixels were classified much like they were with the 40 
section signature, while 84.2 percent of the soybeans were 
recognized as the corn signatures. 

As with the 40 section training signature set, there is 
a fairly close agreement between total recognition and ground 
truth percentages for the various recognition classes. The 
estimation is especially close when the urban pixels are 
omitted from the calculations. 



When 10 sections were used for training, there also were 
insufficient field center pixels for a specific signature for 
dense tree stands. Table 20 gives the performance matrix for 
the classification of the northern 40 sections with the 10 
section signature set. Correct recognition ranges from 82.9 
percent for corn to 22.1 percent for brush and 0.0 percent 
for trees, with an average of 65.3 percent. Brush recogni- 
tion accuracy is greatly reduced compared to either the 40 
or 20 section results. Two and a half times as many brush 
pixels were misclassif led as corn as were correctly recog- 
nized as brush. Bare soil recognition accuracy decreased 
greatly, to 47.4 percent, with more pixels being recognized 
as '.orage than as bare soil. Grass was 80.7 percent recog- 
nized with 15 percent being misclassif led as corn. Corn 
recognition also «a» high. Stubble was not ar well recognized 
(73.6 percent) as it was with the 20 section training set; 
most missed detections were due to the bare soil (9.4 percent) 
and corn (9.4 percent) signatures, A majority of urban pixels 
were recognized by forage signatures. J 

An examination of the recognition and ground truth per- 
centages for the ground truth classes shows that the corn 
estimates is high by approximately 10 percentage points for 
these 10 section signatures. The tree/brush estimate is less 
than half the amount of trees and brush present according to J 

the ground truth, and the RMS error of the estimates is greater 1 

than for the other signature sets. J 

j I 


Figure 10 gives the percentage of correct recognition 
for the four ground truth classes which had specific subclass 
signatures in all three signature sets. The 20 section results 
are slightly better than the 40 section results for all clas- 
ses. It would be expected that the 20 section training set 
should give reduced recognition results since there was less 
information available for training. However, different train- 
ing procedures were used and some of the original 40 section 
signatures extracted by clustering were omitted and others 
were combined for the final signature set. All of the origi- 
nal 20 section cluster signatures were used for classification, 
so in this respect the 20 section set contained more Informa- 
tion than the 40 section signature set. Also, one additional 
channel was used for recognition with the 20 section signa- 
tures. The 10 section training procedure was the same as for 
20 sections, and the 10 section resultB are always poorer whei 
compared to the 20 section results. In the cases of bare soil 
and brush, the decrease is large. 

Table 21 gives the root mean square (RMS) error for the 
percentage estimation of field center pixels over all sections 
using signatures bases on 40, 20, and 10 sections. Errors were 
calculated both with and without the urban pixels. The RMS 
errors show that when all signature classes are considered 
the RMS error in field center proportion estimates increases 
with a decrease in the number rf sections used for training. 


Although field center pixel recognition results indicate 




Table 21. RMS errors In proportion estimation of 
field-center pixels over the northern 
40 sections. 


# TRAINING 
SECTIONS 

40 

20 

10 

- 

RMSE (%) 

With 

Urban 

3.11 

4.36 

5.65 

RMSE (%) 

Without 

Urban 

2.57 

2.11 

5.18 



- - 



the upper limit of recognition performance, the accuracy of 
acreage estimates provides a better measure of the usefulness 
for the prupose of crop acreage estimrtion. Acreage estima- 
tes can be calculated by tallying the recognition results 
for all pixels, section-by-sectlon, over all 40 sections. 

When all pixels are tallied, boundary pixels and other pixels 
which are mixtures are included in the tabulations. The re- 
sults are evaluated here by calculating the root mean square 
(RMS) error for each section and for each signature class. 

An overall RMS error by class also was calculated. 

Table 22 displays the aggregated recognition results over 
all pixels in the 40 section test site. The proportion esti- 
mates (expressed as percentages) of the six ground cover clas- 
ses are compared to the ground truth proportions, and overall 
RMS errors arc presented. The RMS error is seen to be inverse 
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ly proportional to the number of sections used for training 
comparing Table 22 to Tables 18 f 19, and 20 makes it clear 
that the error in estimating the proportions of all pixels 
in the test site is considerably higher than the error in 
estimating the proportions of the field center pixels. 

Table 22. Percentages of 6 ground cover classes 
and recognition percentages over 40 
northern sections using signatures 
from 40, 20, and 10 sections. 

Training Date 

40 20 10 


round Cover 

Ground Truth 

Section 

Section 

Sec t ion 

Corn 

26.5 

36.8 

41.2 

46.4 

Tree/Brush 

17.2 

14.3 

7.4 

2 . 7 

Forage 

CN 

sr 

40.5 

43.7 

42.5 

Bare Soil 

7 2 

5.4 

4.0 

4.8 

Soybean 

3.7 

2.4 

0.0 

0,0 

Other 

3.1 

0.4 

3.7 

3 . 7 

RMS Error 


4.661 

6.352 

10.103 


Another observation that can be made about the data of 
Table 22 is that the proportion of corn was overestimated in 
each irstance and got progressively worse as less training 
was used. On the other hand, the proportion of forage (the 
other major ground cover class) was accurately estimated and 
was not dependent on the training set used. Trees and brush 


were always underestimated and became more so as less data 
were used for train#~.g. 

Table 23 gives v .ie RMS errors calculated over sections 
for each of five ground cover classes. Here as in Table 22, 
the RMS error increases as the number of sections used for 
training decreases. This trend is also found within each 
ground cover class, with two exceptions. First, the RMS 
error for the bare soil class is slightly larger for the 20 
section training set than for 10 sections. Second, the RMS 
error for "other" decreases slightly as the number of train- 
ing sections decreases. 


Table 23. Section-by-section RMS error calculated 
per ground cover class for 40, 20, and 
10 section training sets. 


Training 


40 

Sec 

20 

Sec 

10 

Sec 


Corn Tree/Brush Bare Soil Forage Other 


4.363 j 9.679 8.492 
5.262 10.542 7.942 
4.927 11.976 7.604 


13.776 

S 7.418 

17.198 

11.918 

i 

23.108 i 

16.927 


Since it is very costly in time and resources to collect 
ground truth and accurately identify fields for training, it 
is desirable that training signatures from one area be appli- 
cable to adjacent areas. Such use of signatures for recogni- 
tion In areas other than where they are formed is termed non- 
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local recognition. Forty-eight — sections from the southern 
portion of the test area were used to test the accuracy of 
nonlocal recognition using unadjusted signatures from the 
nothern portion. A tenfold increase in the number of tree 
pixels is the major compositional difference between the south 
and the north portions. In addition, field beans are present 
in the south but not in the north, while urban areas are mis- 
sing from the south. 

Table 24 gives the performance matrix obtained for field 
center pixels from the southern portion of the test area using 
the 40 section signatures from the northern portion. Recogni- 
tion accuracy ranges from 76.1 percent for corn to 0.0 percent 
for field beans (a crop not present in the training area) , wit 
an overall 63 percent correct classification of the field cen- 
ter pixels. A total of 23.9 percent of the forage, 67.9 per- 
cent of field beans, and 31.5 percent of trees and brush were 
misclassif led as corn. Trees were 55.8 percent correctly re- 
cognized, while soybeans tended to be recognized as forage. 
Most missed detection? of bare soil were misclassif led as 
forage ( 30.2 percent). 

The root mean square (RMS) error for nonlocal proportion 
estimation of field center pixels from the southern portion 

1/ Two other- sections in the southern portion are not use- 
able because they are completely cloud-covered on the 
U-2 acquired imagery which were used to generate the 
field identifications in this area. 


Table 24. Performance matrix for classification of field center pixels 

from south 48 sections using signatures from north 40 sections. 

Percent of field center pixels assigned to recognition class: 


Ground truth 
class 

No. 

pixels 

Corn 

Forage 

Tree/Brush 

Bare soil 

Soybean 

Unclassified 

Corn 

549 

76.1 

3.0 

14.0 

0.0 

1.8 

0.0 

Forage 

(Grass 264) 
(Alfalfa 20) 
(Stubble 71) 

355 

23.9 

(21.6) 

(80.0) 

(16.9) 

68.7 

(74.3) 

(20.0) 

(62.0) 

2.0 

(2.7) 

(0.0) 

(0.0) 

3.9 

(0.0) 

(0.0) 

(19.7) 

29.0 

(1.5) 

(0.0) 

(1.4) 

0.0 

(0.0) 

(0.0) 

(0.0) 

Tree/Brush 
(Tree 269) 
(Brush 39) 

308 

31.5 

(32.7) 

(23.1) 

12.3 

(8.6) 

(38.5) 

SI. 9 
(55.8) 
(25.6) 

0.0 

(0.0) 

(0.0) 

2.6 

(l.D 

(12.8) 

1.6 

(1.9) 

(0.0) 

Bare soil 

43 

4.7 

30.2 

2.3 

62.8 

0.0 

0.0 

Soybean 

52 

15.4 

65.4 

0.0 

0.0 

19.2 

0.0 

Field bean 

56 

67.9 

28.6 

3.6 

0.0 

0.0 

0.0 

Total 

1363 

47.5 

28.6 

18.1 

3.0 

2.4 

0.4 

— 

Ground truth (%) 


40.3 

26.0 

22.6 

3.2 

3.8 

4.1 


PMS error In proportion estimation (%) 13 3.97 

Overall percent correct classification of pixels = 63.0% 
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of the test site is 3*97 percent, 1-3 times larger than the 
error for local recognition in the north* Comparison of the 
ground truth percentages to the total field center recognition 
percentages (Table 24) shows the largest error was an over- 
estimation of corn pixels, with underestimates of trees/brush 
and field beans (unclassified). 

The signature set formed using all 40 sections in the 
northern portion of the test site also was used to estimate 
proportions of the ground cover classes over the entire south- 
ern portion of the test site, including nonfield-center pix- 
els. In Table 25, the proportion estimates are compared to 
the ground truth proportions, and with results obtained local- 
ly in the north. The RMS error is considerably higher for 
the nonlocal recognition. Examination of the estimates for 
each ground cover class show that the major discrepancy in 
the south 48 sections is an even larger overestimate for corn 
that was obtained for the north 40 sections* Proportions for 
most other ground covers were underestimated. 

Table 26 displays the section-by-section RMS error for 
each ground cover class for the recognition over both the 
northern and southern portions of the data. As is expected, 
the errors are higher in the southern portion of most ground 
cover classes; the only exception is tree/brush. The larg- 
est differences in errors for the two areas are for corn and 
bare soil* 

Results for proportional area estimation over the entire 


Table 25 


Ground truth proportions and recognition 
estimates for local (north 40) and non- 




North 40 sections South 48 sections 


Ground cover class 

Ground truth 

Recognition 

results 

Ground truth 

Recognition 

results 

Com 

26.5 

36.8 

33.3 

48.0 

Trees/Brush 

17.2 

14.3 

16.5 

13.3 

Forage 

47.4 

40.5 

35.5 

30.9 

Bare soil 

7.2 

5.4 

7.2 

3.3 

Soybeans 

3.7 

2.4 

4.0 

4.4 

Other 

3.1 

0.4 

4.7 

0.0 

SMS error 

4.661 


6.891 
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northern portion of the test site showed that the RMS error 
of proportion estimates did increase substantially as lesser 
amounts of data were used for training. This was true both 
for errors in the estimated overall proportions in the test 
site and on a seetion-by-section basis within each crop type. 
The major overall error was an overestimate of the proportion 
of corn in the test site. The overestimate became larger as 
less training data was used, with the proportion of trees 
and brush being underestimated correspondingly. The section- 
by-section RMS errors for these two classes follow the same 
pattern. 

~ Table 26. Section-by-sectlon RMS error (in percent) 

for proportion estimation of north and 
south portions of test site (40 section 
signatures) . 


Site 

Corn 

Tree/Brush 

Soybeans 

Bare Soil 

Forage 

Other 

N 

13.776 

7.288 

4.983 

4.363 

9.679 

5.901 

S 

17.000 

7.225 

5.649 

7.374 

10.953 

7.798 


Interpretation of results for the classification of field- 
center pixels with the same test area is confounded by the fa-t 
that slightly different training procedures were with 40 sec- 
tions than with 20 and 10 and that different numbers of S-192 
spectral channels were used in the three cases. The overall 
correct percentage for 40 training sections is greater than 


that for 10 sections, but less than that for 20 sections. 
However, the relative ranking of percentages is directly re- 
lated to both the number of channels used and the number of 
recognition signatures used. Thus, the results obtained do 
not by themselves lead to a clearcut conclusion about the 
effects that the amount of data used for training might have 
on field-center classification performance, although one 
would expect, in general, that results would improve as the 
amount of training data was increased. 

Some discussion of the observed recognition results and 
possible reasons for them is in order. Corn was represented 
by two major and two minor clusters in each training set. 

There were enough corn pixels available so that the major 
clusters, at least, remained relatively constant from set to 
set. The observed differences in corn recognition then are 
more directly related to the other signatures that were de- 
veloped. Trees and brush were the major competing signatures 
and captured 18 percent of field-center corn pixels in the 
40 section case. This reduced to 3.4 percent in the 20 sec- 
tion case. The major difference in brush signatures was that 
a single large cluster overlapping the two major corn clusters 
was found in the 40 section case and two smaller clusters 
found for 20 sections. 

An examination of signa- .ire plots showed that the 40 sec- 
tion brush cluster substantially overlapped the major corn 
clusters. Evidently, the corn/brush decision boundaries were 
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shifted sufficiently, when the brush was represented by two 
clusters (20 section training), to cause a substantial in- 
crease in corn classification and decrease in brush classi- 


fication . 

Another result of the smaller number of pixels avail- 
able for training was that clusters contained fewer pixels 
and consequently were not as representative of ground cover 
classes in the test site. The extreme case was trees for 
which there were insufficient training pixels to form a re- 
cognition signature in the 10 section case. 

In conclusion, the crop recognition accuracy which was 
achieved during this investigation was shown to be related 
to the amount of data available for training the computer. 
Accuracy increased as 10, 20, and the 40 sections were made 
available for extracting training statistics via a supervised 
clustering approach. Even with 40 sections available for 
training, however, the average absolute accuracy of roughly 
70 percent for 5 classes was somewhat disappointing. These 
relatively low values were attributed to the facts that: 

1) the data were gathered at a non-optimum time in 
early August when corn, and other crops were quite 
variable in their state of maturity, 

2) the atmospheric conditions over the test site were 
were fairly hazy thereby reducing available con- 
tract and , 

3) the data gathered by the S-192 had some signifi- 
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cant dificlencles with regard to slgnal-to-noise 
ratio In some bands, the dynamic range covered by 
the signals* and channel-to-channel spatial regis- 
tration . 

In order to more fully address the question of crop sur- 
vey accuracy attainable with multlspectral scanners and auto 
matic processing systems, additional studies should be under- 
taken using data gathered with other sensors at other times 
and at other locations. 


Mixtures Processing to Improve Estimates from S-192 Data 


When a spatial resolution element overlaps the boundary 
between two or more ground classes, the radiation detected 
is a mixture from the classes involved. The spatial resolu- 
tion of the Skylab S-192 scanner is such, compared to the 
size of the fields or areas of the ground cover classes, that 
the frequency of mixture pixels is expected to be fairly large; 
The use of conventional multlspectral processing techniques on 
mixture pixels will likely result in an increased probability 
of improper classification. 

Conventional processing techniques rely on compensating 
errors to cancel the effects of misclasslf ications or on some 
fixed bias in the estimate which is measurable to produce ac- 
curate proportion estimates. The results reported above in- 
dicate that the errors do not always compensate. 
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ERIM has developed special processing techniques — ^ to 
handle such situations. These techniques recognize that pixels 
may contain mixtures of different ground covers and estimate 
the proportions of each class present in a pixel. 

The mixture algorithm first determines the most probable 

If 

single signature for a pixel and the attendant chi-square value — 
Next, the proportions of the most probable pair of classes and 
an associated chi-square value are calculated for the pixel. 

The pixel may be further analyzed as a mixture of three or 


four classes. For reasons of processing time and computer 
space requirements, consideration here was limited to either 
pure or two-class mixture pixels. This is not an unrealistic 
restriction when one considers that in an agricultural area 
like the current data set, most mixture pixels will occur at 


1/ Horwitz, H.M. , J.T. Lewis, and A.P. Pentland, "Estimating 
Proportions of Objects from Multispectral Scanner Data,” 
Report No, 1Q9600-13-F, Environmental Research Institute 
of Michigan, Ann Arbor, May 1975. 

Mai 11a, W.A. and R.F. Nalepka, "Atmospheric Effects in 
ERTS-1 Data and Advanced Information Extraction Techni- 
ques," Symposium on Significant Results Obtained from 
ERTS, Vol. 1, Goddard Space Plight Center, Greenbelt, 
Maryland, 1973. 

Horvitz, H.M. . R.F. Nalepka, P*D. Hyde, and J.P. Morgenatern, 
"Estimating Proportions ef Objects Within a Single Resolu- 
tion Element of a Multispectral Scanner," Seventh Inter- 
national Symposium on Remote Sensing of the Environment, 

May 1971. 

2/ The chi-square value is a measure of the likelihood that 
the pixel is a member of the signature distribution being 
considered. 
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field boundaries such that the vast majority of mixture pixels 
will be mixtures of two ground classes. 


Next, these chi-square values are compared to the values 

set for two parameters of the mixtures algorithm. If the chi- 

2 

square for the winning pure case is less than X^ , the pixel 


is considered to be pure. If it is not pure according to this 

test, the chi-square vailue for the two-class mixture case is 
2 

less than X ? , the pixel is determined to be the mixture in- 
dicate d. Otherwise, the pijfel is condidered to be "alien", 

i.e*, from a class or classes not luded in the signature 

2 2 , 

set. Currently X^ and X^ are cV f ^ empirically to minimize 
the error of the proportion estimate over some training area 
of known proportion. 


One factor affecting the performance of the mixtures 


processor is the geometrical configuration of the signatures 
used to define the ground cover classes . The signatures can 
be defined as hyper ellipses in an n^ dimensional orthogonal 
space where n is tue number of data bands or SDOs. A simplex 
is the hypervelume defined by thb m signature means. Pure 
pixels are those located hear signature means, while mixture 
pixels are located between signatures.' Further, if for a 
given set of signatures, a simplex they define is not convex, 
e.g. , one sigtiaturfe being a linear combination of two signa- 
tures, then the simplex is said to fe'e degenerate. For such 
a simplex a nonunique answer is mathematically possible and 


such simp lexis s should not be used for processing. _ 




The initial step in implementing the mixtures classifier 
is to define a signature set. It is important that the sig- 
natures used be sufficiently distant, one from the other, so 
that the simplex formed by the set of signatures will not 
be degenerate because the algorithm breaks down in that cir- 
cumstance. To keep the signatures far apart and to conserve 
processing time which increases as m(m + l)/2 (for m signa- 
tures) , the size of the signature set is kept as small as 
possible . 

The signature set for the forty section training consis- 
ted of 15 signatures with the following distribution; 

Corn 4 signatures 

Trees 2 signatures 

Brush 1 signature 

Grasses, weeds, etc. 5 signatures 

Bare soil 1 signature 

Soybeans 1 signature 

Alfalfa 1 signature 

Since soybeans and alfalfa are very minor ground covers 
in the test site, they were excluded from this study. An 
analysis of the tree and brush signatures showed the two tree 
signatures to be very disparate, but the brush was similar 
spectrally to one of the tree signatures with an overlapping 
of some 75 percent. The brush signature, representing pri- 
marily areas of scrub forest, was, therefore, combined with 
the tree signature. The bare soil signature was included in 
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be called a mixture ot grass ana Dare sou. 

The signature set described above vas applied to a small 
550 pixel section of the data. Subsequent analysis showed 
that very little of the data was being classified as grass 
and the error rate was substantial. The initial choice of 
a grass signature was apparently a poor one. Accordingly, 
a different grass signature was selected, this one being from 
the grass cluster with the greatest number of grass pixelB. 

The test data were again processed through the mixtures 





classifier. The results were somewhat better* but the total 
error in the proportion estimation for the test data was still 
slightly inferior to the error rate achieved using the con- 
ventional linear classifier. It was further noted that the 
chi-square thresholds chosen* which minimized the total error 
of the proportion estimate, resulted in 73 percent of the 
pixels being counted as "pure" and only 18 percent of the 
pixels being assessed as mixtures. Many more mixture pixels 
had been anticipated. 

One explanation for these results is that the conventional 
classification had been done using 15 signatures - the mixtures 
approach used only six. It seems that it would be necessary 
to further pack the signature simplex with other grass signa- 
tures to increase the grass classification rate. Such a pro- 
cedure would increase the grass classification and the accuracy 
of overall elasslf ication but it would further decrease the 
number of pixels processed as mixtures. 

Another reason why few pixels were called mixture pixels 
probably is the poor signal range of the data. Not only are 
the signature means relatively close together, but also the 
individual distributions are very broad. Pixels which are 
mixtures of separate classes may be very near the center of 
another distribution and may be classified as being from that 
distribution. Figure 11 illustrates the point showing 2-dlmen- 
sional ellipses which represent a boundary for a chi-square 
value of one for each of the distributions pictured. The 
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This pixel is a mixture of classes 
A and C and is not a member of B. 
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pixel Illustrated is a mixture of Classes A and C but will 
be classified as being a pure pixel of Class B. Because of 
these results, no further mixtures processing was performed 
on the agricultural test site data. 


S-192 Maps 

Figures 12 and 13 are color maps prepared from the S-192 
data. These maps are coded as follows: 


Green 


Corn 


Red and orange 


Forage 


Blue 


Trees and brush 


White 


Bare soil 


Black 


unclassified 


Figure 12 represents the northern 40 sections which contained 
the training set. Figure 13 is for the southern 40 sections. 

The odd, non-rectangular shapes are the result of having 
scan-line straightened the results prior to the generation 
of maps. The computer program did not allow for the yaw of 
Skylab or the effects of the earth's rotation beneath the 
spacecraft , 


Vigor and Yield Estimation 

The statement of work for the contract indicated the 
investigators would attempt to estimate crop yield or vigor 
from Skylab data. Given that the relevant data for such es- 
timates are available for only the August 5, 1973 pass, corn 
is the only feasible crop for such a study. Wheat had al- 





msm 

















" - Vvjt, si if 

WrJ 1% . /Vs jIFm. V ? 


v ,v 


























gaW g 


OTRODUCIBILITY OF THE 

ORIGINAL PAGE IS POOR 









Corn Corn Corn Alfalfa Trees Trees Brush Soybeans Bare Grass 2 Grass 1 Weeds Pasture Pasture Unclassi 

+ Grass fied 








ready been harvested. The acreage of soybeans was relatively 
small and grasses are in themselves so heterogeneous as to 
be inappropriate. 

However, as pointed out previously corn was in two dis- 
tinct states on thit date. Some of it was tasselled and some 
was not. These disparate states produced distinct signatures. 
Further complicating the situation were those fields that were 
partially tasselled. 

After carefully considering the poor signal quality of 
S-192 data, the resolution of the S-190B imagery, and the re- 
sults of acreage estimation reported above, it was clear that 
further attempts to discriminate for vigor would be fruitless. 
Determination of vigor requires good resolution in photographic 
imagery or a large signal range in S-192 data and, in addition, 
variation in vigor in ground truth. Such variations are more 
likely to occur over large test sites or among scattered sites. 
Technicians collecting ground truth for this study found simi- 
larity of vigor to be the rule rather than the exception. There-^ 
fore, future studies investigating Vigor might choose test 
sites so as to provide good contrast in vigor rather than to 
be representative of typical field size and cropping patterns. 

It is also the author’s opinion that vigor can not be translated 
into yield estimates until better agronomic models are avail- 
able to relate the canopy to yield. 

Resource Requirements for Multispectral . Automated Crop Surveys 


An important factor to be considered when judging the 




utility of machine processed multispectral scanner data for 
agricultural applications is the amount of resources required 
to extract the necessary information. The resources required, 
which include computer and personnel time as well as their 
costs, are discussed in this section. The multispectral data 
set upon which the estimates of necessary resources are based 
was collected by the S-192 multispectral scanner aboard the 
Skylab space station during an early August overflight of 
the southeast Michigan test site. The test site comprised 90 
etnviguous sections (each being approximately 640 acres) in 
Ingham County, Michigan. 

The primary question addressed in this section is how 
many resources are needed to carry out automated multispectral 
crop surveys. The secondary task was to examine costs and 
processing results as a function of the amount of training 
information used in that survey. To satisfy these needs, the 
training of the computer and classification of the data were 
carried out over the northern 40 sections of the test site 
with data from: (1) 40 sections, (2) 20 sections, and (3) 

10 sections being used for training and with the full 40 
sections being used for classification and evaluation in 
each case. In addition, the SO sections in the southern 
part of the test site were classified using the statistics 
generated by training on the northern 40 sections. Classic* 
flcatlon results were discussed previously. 

The overall processing flow is explained below. Briefly, 
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there were five stages for each test case: 


1) supervised clustering for generating signatures, 

2) selection of optimum band subsets, 

3) classification, 

4) evaluation of results for field center pixels, and 

5) evaluation of proportion estimation (from classifi- 
cation counts.) 

Outside of possible needs for data reformatting, data 
quality assessment, and location of training fields, the first 
step in the machine processing of multispectral scanner data 
is training, i.e., the generation of signatures for the com- 
puter which define the statistical characteristics of the 
ground classes of interest as seen in the data. There are 
many available means by which the signatures may be determined 
The supervised clustering method used here was judged by us 
to be appropriate for this investigation. 

When large numbers of spectral bands are available, the 
next step generally entails the selection of subsets of these 
bands which, based on the training data characteristics, will 
not result in a serious loss of classification accuracy. This 
step is employed to reduce the overall resource requirements 
since the use of additional spectral bands increases the com- 
puter time necessary to classify the data. Clearly, then, a 
simple discrimination problem with optimized high quality 
data will require fewer resources than a more difficult prob- 
lem with lower quality data. 





Following the selection of optimum spectral band subsets, 
the data are classified and the necessary information is ex- 
tracted. While it isn’t required for an operational survey 
system, for this investigation the results achieved during 
classification were fully evaluated and compared with ground 
observations. The evaluation was accomplished on both the 
classification accuracy of field center pixels (i.e., pixels 
clearly inside field boundaries each of which contains infor- 
mation on only one class) and the overall acreage or propor- 
tion estimation accuracy. 

Computer processing for this investigation was carried 
out at ERIM using an IBM 7094 Mul t ispec tral Processing System. 
(The processing system includes both hardware and software). 

The computer time reported below is given in terms of 7094 
execution time - care has been taken to eliminate the time 
spent in spinning the data tapes.- ^ 

Table 27 documents the 7094 CPU time to accomplish each 
of the five stages discussed above. Here we see that the CPU 
time required to establish training signatures via super- 
vised clustering is between 1.5 and 5 minutes, and the time 
is clearly and directly related to the amount of data being 

1/ The 7094, being a second generation computer, is tape 
oriented. All third and later generation machines - 
IBM 360, IBM 370, AMDAHL, Univac, etc., are disc oriented, 
multiprogramming machines which means that input/output 
(I/O) time is much faster and the user does not pay for 
the central processing unit (CPU) while performing I/O. 
Thus, the times reported are comparable to times involved 
on other machines. 
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Following the selection of optimum spectral band subsets, 
the data are classified and the necessary information is ex- 
tracted. While it isn’t required for an operational survey 
system, for this investigation the results achieved during 
classification were fully evaluated and compared with ground 
observations. The evaluation was accomplished on both the 
classification accuracy of field center pixels (i.e., pixels 
clearly inside field boundaries each of which contains infor- 
mation on only one class) and the overall acreage or propor- 
tion estimation accuracy. 

Computer processing for this investigation was carried 
out at ERIM using an IBM 7094 Multispectral Processing System. 
(The processing system includes both hardware and software). 
The computer time reported below is given in terms of 7094 
execution time — care has been taken to eliminate the time 
spent in spinning the data tapes .- • 

Table 27 documents the 7094 CPU time to accomplish each 
of the five stages discussed above. Here we see that the CPU 
time required to establish training signatures via super- 
vised clustering is between 1.5 and 5 minutes, and the time 
is clearly and directly related to the amount of data being 


1/ The 7094, being a second generation computer, is tape 
oriented. All third and later generation machines - 
IBM 360, IBM 370, AMDAHL, Univac, etc., are disc oriented, 
multiprogramming machines which means that input/output 
(I/O) time Is much faster and the user does not pay for 
the central processing unit (CPU) while performing I/O. 
Thus, the times reported are comparable to times involved 
on other machines. 


Table 27. Data processing machine times in terms of 
7094 CPU time with varying amounts of data 
used for training. 


North 40 section processing 
No. of sections used for training 


Step processing procedure 

40 

Sections 

20 

Sections 

10 

Sections 

South 50 section 
processing 

1. Supervised Clustering 

4*49.7" 

2*37.1" 

1*28. 8" 

— 

2. Selection of Optimal 
Signal Bands 

5*54.3" 

5*39.4" 

2*37. 3" 

— 

3. Classification of 
Data 

6*50.1" 

10*6.9" 

5*38.7" 

8*25.0" 

4. Evaluation of 

Classification for 
Field Center Pixels 

56.2" 

50.2" 

49.7" 

1*10.0" 

5 . Evaluation of 

Classification of 
Full Sections 

56.9" 

56.8" 

56.9' 

1*11.2" 


clustered. The situation for items 2 and 3 is not so straight- 
forward. For the selection of optimum bands, the necessary 
time did not increase uniformly as the number of sections 
available for training increased. This was at least partially 
becuase the number of signatures which resulted from the 10, 

20, and 40 sections did not increase uniformly with the add- 
ition of sections. A slightly different situation applies 
for the necessary classification time. Here the same amount 
of data was processed, however, as a result of the previous 
steps, the number of signatures, and spectral bands used for 
classifying the data were fewer when training on 40 sections 
than when training on 20 sections. The significant fact here 


is that it took between 6 and 10 minutes CPU time to process 
S-192 data covering 40 sections on the ground. The evaluation 
steps took roughly a minute each. For larger data sets one 
can estimate classification times by using the fact that the 
40 section classification in column one used the same signa- 
tures and spectral bands as the 50 section classification in 
the last column. In both cases roughly 1.5 to 2 minutes were 
■'■equired per 10 sections. 

As previously stated, all computer times provided In 
Table 27 were specified as the time required to execute the 
jobs on the ERIM 7094 Multispectral Processing System after 
adjustment to substantially exclude I/O time, thus, estimating 
CPU time to allow a more direct comparison between the 7094 
and other machines. In the remainder of this subsection we 
attempt to provide the information necessary to translate the 
execution time for these jobs to an equivalent time on other 
common computer systems. 

The only way to rigorously compare computation rates for 
various computers is by carefully controlled benchmark testing. 
Unfortunately, such controlled tests were not possible. What 
was done instead was to gather basic timing information on 
other computer systems. The basic information gathered includes 
cycle time, execution for an integer add, and execution time 
for an integer multiplication. These quantities give a gen- 
eral impression of relative processing times. However, a 
very accurate calculation of relative processing times is not 
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possible because of the different hardware peculiarities 
introduced on various machines to increase the speed with 
which several instructions may be sequentially executed. Also, 
differences in computer word size and differences in machine 
code generated for similar FORTRAN programs will contribute 
to differences in processing rates between machines as well. 
Table 28 lists the basic timing information for the computer 
systems considered. The entries in Table 29 are rough esti- 
mates of relative processing times for the same computer sys- 
tems . 


Table 28. Computational characteristics for some computers 

(all times given in nanoseconds) 


Computer 

Cycle time 

Integer add 
instruction 

Integer Multiply 
Instruction 

IBM 7094 

2,200. 

4,400 

15,400 

IBM 360/67 

750 

1,400 

4,800 

IBM 370/145 

608 

2,100 

20,100 

IBM 370/168 

80 

160 

400 

AMDAHL 470/V6 

32 

64 

256 

CDC 6500 

1,000 

600 

5,500 


* All numbers in this table came from IBM, AMDAHL, and CDC publica- 
tions describing the CPUs in question. 


A new special purpose multichannel data processing system 
not included in Table 29 is in the final stages of development 
and testing at ERIM. This system, the MIDAS (Multivariate 


Table 29. 


Relative processing times of selected 
computer systems . 


Computer 

Installation 

Operating 

system 

Relative processing time 

IBM 7094 

ERIM 

UMES 

•f* 

34.0 

IBM 360/67 
(Duplex) 

Univ. of Mich. 

UMMPS/MTS 

* 

9.0 

IBM 370/145 

— 

— 

30. 0 + 

IBM 370/168 

Univ. of Mich. 

UMMPS/MTS 

* 

1.5 

AMDAHL 470/V6 

Univ. of Mich. 

UMMPS/MTS 

1.0 

CDC 6500 

MSU 

SCOPE 

15.0*" 


* From preliminary benchmark tests performed at the University of 
Michigan, Ann Arbor, Michigan. 

+ Approximate values. 


Interactive Digital Analysis System) is expected to process 
data orders much faster than existing general purpose computer 
systems. For example, if provided in high density form, a 
7.2 x 10° pixel data set (a LANDSAT MSS frame) could be clas- 
sified in about 40 seconds on MIDAS. 

Multispectral remote sensing data connot be processed 
by the mere existence of a suitable computer processing system. 

A human as data analyst or parameter coder is necessary to set 
up individual computer jobs and interpret results. 

In a routine processing situation, the data flow is well 
defined as is the manner in which decisions regarding the pro- 
cessing of data are made. Therefore, the jobs for the personnel 


involved in processing the data are primarily "housekeeping" 
chores such as the management of the multispectral data and 
ancillary data, and the coding of parameters for and subse- 
quent running of individual computer jobs. 

In performing this study, we estimated those personnel 
resources which would be required once a routine had been 
established. Therefore, the data analyst time in Table 30 
is thought to be a good estimate of the time requirement for 
an operational system similar to the present ERIM processing 
system on the IBM 7094. Of course it is entirely possible 
to design an operational system to be more fully automated, 
combining several steps (e.g signature extraction and optimum 
band selection) and automating "housekeeping" chores, etc., 
so that personnel time requirements for some future system 
could be sharply reduced from those given in Table 30. 

The times in Table 30, within limits, are not a function 
of the amount of data in a data set, but are only a function 
of parameter selection and job set up. Thus, processing 900 
sections as a unit would require essentially the same person- 
nel time as processing 90 sections. 

In summary, the personnel time required for processing 
multispectral data is highly variable, depending upon the sys- 
tem design and the amount of active analyst intervention al- 
lowed. Also, the personnel time involved is not primarily 
a function of the size of the data set. 

Thus far we have described the necessary resources to 


Table 30. 


Personnel time required to prepare computer 
jobs for each step. 


No. of sections used for training 

40 20 10 South 50 sec- 

Step Processing procedure Sections Sections Sections tions processing 


1. Supervised clustering 30 min. 23 min. 19 min. 

2. Selection of optimal 

signal bands for 

training signatures 10 min. 10 min. 10 min. 

3. Classification of 

data by training 

signatures 10 min. 10 min. 10 min. 

4. Evaluation of 

classification for 

field center pixels 20 min. 20 min. 20 min. 


5. Evaluation of 

classificaiton of 

full sections 20 min. 20 min. 20 min. 


10 min. 


20 min. 


20 min. 


perform training, classification, and post-classification as- 
sessment. There are, however, additional costs involved. The 
first is the acquisition and assimilation of the ground truth 
information. The second is in the area of data preparation. 

The ground information used for this study was acquired 
by MSU personnel. The costs of ground truth acquisition is 
about directly proportional to the amount of training data 
necessary. That is, it would cost about half as much to ac- 
quire ground truth for 20 sections as it would for 40 sections 
Detailed records on ground truth costs were not compiled. 

For this study, a semiautomated technique was used to 
locate the scan line and scan point coordinates of the ground 





truthed fields and areas of interest. The stages in this 
process and attendant costs are given in Table 31. The entries 
in the table are for the full 90 sections; times for proces- 
sing fewer sections are directly proportional, except for 
basic setup costs. 


Table 31. Times associated with assimilation of 

ground truth information for 90 sections. 

Personnel 

Task Machine Machine time 


1. Acquisition of large scale Photographic 

photography laboratory 

2. Annotation of photography 


4 hours 


48 hours 


3. Digitization of coordinates X-Y digitizer 15 hours 


4. Regression for transformation 7094 


15 hours 


0.05 hours. 3 hours 


5. Transform digitized coor- 
dinates to data coordin- 


0.3 hours 


1 hour 


Data manipulation occurs between receipt of the data tapes 
and the training procedure. Included are such operations as 
reformatting or copying a subset of the delivered data tapes, 
or entering the data into a disc file data base. Also included 
is the checking of data quality via graymaps and histograms. 
Again it may be that, for an operational system, some of these 
steps may be ignored. In any event, the costs involved in 
any of these steps are related to the total amount of data 
processed and to the total amount of data used for training. 

The steps included in this category could take up to a full 



t 

1 
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days time. 

Conclusion 

Using the existing ERIM IBM 7094 Processing System, which 
has not been optimized for handling multispectral scanner data 
in an operational mode, the end-to-end time, to prepare the 
data, train the computer, and classify a large data set re- 
quiring the application of no special processing techniques 
would be three to five days. By optimizing this system or 
utilizing other existing computer systems one might be able 
to reduce the elapsed time. Elapsed time is important because 
of the need for timely estimates of crop acreage and/or vigor 
in any operational system. 

While other existing systems can compute faster than the 
IBM 7094, they may or may not be more economical. Per minute 
costs to use these systems are not the same. The faster ma- 
chines are usually more costly. So while, on the basis of 
speed alone, one system may be significantly better than another, 
the processing costs may be similar. 

It is obvious that people are still a major resource re- 
quirement, even for so-called automated crop surveys. Systems, 
such as ERIM’s MIDAS, which are designed to take into account 
the special characteristics of multispectral data and the 
special needs of people by providing interactive displays and 
data manipulation capabilities will certainly make future 
automated multispectral crop surveys more effective from the 
standpoint of both cost and time. Special processing techni- 
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1 / 

ques such as signature extension algorithms— are being 
developed which should reduce overall costs even more. 

Cost and Accuracy of SRS Procedures 

Current crop acreage estimates for the United States are 

produced by the Statistical Reporting Service (SRS) of the 

U. S. Department of Agriculture. These procedures are dis- 

2 / 

cussed in the U.S.D. A. Miscellaneous Publication 1308 — and 
earlier versions thereof. Basically, acreage estimation is 
a result of a probability sampling procedure using an area 
frame or a multiple-frame involving both an area frame and a 
list frame. This procedure is an evolution from the non-prob 
ability procedures previously used. 

Information was compiled on the cost-error relationships 
for SRS methods. Based on 1967 costs these relationships are 
depicted in Table 32. These data were calculated based on 
SRS and other information. 

On the basis of 1973 costs of a national probability sur- 
vey, a total probability survey costs $6.80 million while a 
survey of crop acreage only using a total probability survey 


1/ Henderson, R. G. , G. S. Thomas, and R. F. Nalepka, "Methods 
of Extending Signatures and Training Without Ground In- 
formation, " Report No. 109600-16-F, Environmental Re- 
search Institute of Michigan, Ann Arbor, May 1975. 

SRS, U. S, D. A. Scope and Methods of the Statistical 
Reporting Service , "U. S. D. A. Miscellaneous Publication 
1308,, U.S. Government Printing Office, 1975. 
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would cost $2.74 million. In other words, crop acreage 
estimation accounted for 40 percent of the cost. We there- 
fore used 40 percent of all item survey costs to calculate 
the 1967 data for crop acreage costs shown in Table 32. 


Table 32. Cost - Error, Relationships for SRS Methods. 

(data are costs in millions of 1967 dollars) 


Error levels 
(percent) 


Total probability 
sample surve - 


Crop acreage surve 


Area 

Multiple 

Area 

Multiple 

sample 

frame 

sample 

f rame 


sample 


sample 


0.0 

0.5 

62.00 

17.10 

44.20 

13.00 

24.80 

6.84 

17.68 

5.20 

1.0 

7.9 

0 

7.60 

3.1 

6 

3.0 

4 

1.5 

5.8 

0 

5.60 

2.3 

2 

2.2 

4 

2.0 

4.1 

3 

4.13 

1.6 

5 

1.6 

5 

2.5 

3.7 

6 

3.76 

1.5 

0 

1.5 

0 

3.0 

3.4 

0 

3.40 

1.3 

6 

1.3 

6 

3.5 

2.9 

0 

2.90 

1.1 

6 

1.1 

6 

4.0 

2.4 

0 

2.40 

0.9 

6 

0.9 

6 

4 . 5 

2.1 

5 

2.15 

0.8 

6 

0.8 

6 

5.0 

2.1 

0 

2.10 

0.8 

4 

0.8 

4 

5.5 

2.0 

0 

2.00 

0.8 

0 

0.8 

0 

6.0 

1.9 

0 

1.90 

0.7 

6 0.7 

6 


Inflation and other cost changes raised costs by an 
average of 9.3 percent per annum between 1967 and 1973. Th 
means the cost index with a base of 1967 was 165.1 in 1973. 
Extrapolating the 1967 figures accordingly by increases of 
65.1 percent yields the data shown in Table 33. 

In Table 34 the data have been rearranged to relate 
error levels to cost levels. These data relate to national 

Table 35 converts these to 


error levels for major crops. 
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Table 33. Error-Cost level relationships 

(Data are costs in millions of 1973 dollars). 


Error levels Total probability 

(percent) sample survey Crop acreage survey 



Area 

sample 

Multiple 

frame 

sample 

Area 

sample 

Multiple 

frame 

sample 

0.0 

102.30 

72.93 

41.17 

29.17 

0.5 

29.54 

22.46 

11.35 

8.98 

1.0 

12.54 

12.06 

5.25 

4.82 

1.5 

9.57 

9.24 

3.85 

3.70 

2.0 

6.80 

6.80 

2.74 

2.74 

2.5 

6.20 

6.20 

2.48 

2.48 

3.0 

5.61 

5.61 

2.24 

2.24 

3.5 

4.79 

4.79 

1.92 

1.92 

4.0 

3.96 

3.96 

1.58 

1.58 

4.5 

3.55 

3.55 

1.42 

1.42 

5.0 

3.47 

3.47 

1.39 

1.39 

5.5 

3.30 

3.30 

1.32 

1.32 

6.0 

3.14 

3.14 

1.26 

1.26 


Table 34. Crop acreage estimation: 
Costs (Millions 1973 dollars) 

Cost- Error relationship 
- Errors (percent) 

1.26 

6.0 

1.32 

5.5 

1.39 

5.0 

1.42 

4.5 

1.58 

4.0 

1.92 

3.5 

2.24 

3.0 

2.48 

2.5 

2.74 

2.0 

3.70 

1.5 

4.82 

1.0 

8.98 

0.5 

29.17 

0.0 


BE 



Table 35. Specified levels of typical sampling errors 

in 




major U*3* farm commodities. 






Percent 







Survey Cost 

(million dollars) 

Area sample 

3*40 3.76 

4.13 5.80 

7.90 

17.10 

62.00 

Multiple frame sample 3.40 3.76 

4.13 5.60 

7.60 

13.00 

44.20 


Typical sampling error 

in major commodities 

3.0 

2.5 

2.0 

1.5 

1.0 

0.5 

Individual commodities 

sampling error. b 



(percent) 



Wheat 


3.2 

2.6 

2.1 

1.6 

1.1 

0.7 

Rye 


9.0 

7.5 

5.9 

4.5 

3.0 

2.0 

Rice 


15.8 

12.6 

9.9 

7.8 

5.5 

3.5 

Corn 


2.1 

1.8 

1.4 

1.1 

0.8 

0.5 

Oats 


3.1 

2.6 

2.1 

1.7 

1.2 

0.7 

Barley 


5.4 

4.5 

3.5 

2.7 

1.9 

1.3 

Potatoes 


18.5 

15.5 

12.6 

9.5 

6.6 

4.2 

Soybeans 


3.4 

2.8 

2.2 

1.7 

1.2 

0.8 

Peanuts 


9.5 

8.0 

6.3 

5.0 

3.6 

2.2 

Tobacco 


5.1 

4.3 

3.4 

2.6 

1.8 

1.2 

Cotton 


4.8 

4.0 

3.1 

2.4 

1.7 

1.1 

Cattle 


2.3 

1.9 

1.3 

1.0 

0.7 

0.5 

Hogs 


4.4 

3.8 

2.9 

2.2 

1.6 

1.0 

Sheep & Lambs 


13.1 

11.0 

8.9 

6.8 

4.5 

3.0 

Poultry 


9.2 

7.8 

6.2 

4.8 

3.3 



itiajor commodities refer to items that are produced on most farms in the United States* 
h 

Sampling errors in the production characteristics of individual items corresponding to 
the specified levels of typical sampling error in major U.S. farm commodities. 

Sources Y. Hayami and W* Peterson, "Social Returns to Public Information Services," American 
Economic Review, LXII No* 1, March 1972* 


Table 36. Sampling errors for crop acreages by region, 1973* 


N . Cent . 


South 


Percent Sampling Error 


Corn 

4.0 

1.6 

3.1 

7.3 

Soybeans 

12.7 

2.2 

4.3 

- 

Rice ..*.**.. 

- 

- 

12.1 

14.7 

Sorghum Grain. . . . 

21.6 

4.6 

5.3 

9.2 

Wheat, Winter. . . . 

6.2 

3.6 

4.7 

3.4 

Wheat, Spring, . . . 

- 

4.5 

- 

9.6 

Oats 

6.2 

2.4 

6.1 

12.2 

Barley 

9.0 

5.0 

10.4 

4.9 

Cotton . 

- 

15.0 

3.4 

7.0 

Potatoes 

11.9 

20.2 

33.5 

12.0 

S' Beets 

- 

14.9 

- 

8.2 


Source: Information provided by SRS, USDA* 
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error levels for specific crops . Clearly the sampling error 
increases for crops produced on fewer farms. 

Table 36 relates specified sampling errors at the na- 
tional level the corresponding sampling errors at regional 
levels. This table shows the actual sampling errors in 1973 
and demonstrates that regional errors are larger than national 
errors and that regional errors are increased when the crop 
is less important in that region. It also illustrates the 
differences among crops even at the national level. 

The proportion of total costs attributed to crop surveys 
was 40 percent. This figure covers total crop survey costs 
including yield and acreage estimation. We have not estimated 
the marginal cost of crop surveys for acreage estimation 

only. However, it is likely that deleting only the acreage 
estimation from SRS activities would effect only a small savings. 
In order to provide information about livestock production, 
the enumerative surveys would need to be continued. Only a 
few questions would be deleted. There would, however, be a 
possible savings as a result of redesigning the sample to re- 
duce costs while obtaining the same accuracy with respect to 
livestock production. No specific dollar amount was estima- 
ted but we believe that it would be relatively small. 

Criteria for Crop Acreage Estimation by Remote Sensing 

A feasible crop acreage estimation procedure using remote 
sensing data of the type produced by Skylab must meet at least 


three criteria: 


1) Timeliness 

2) Secrecy until public release 

3) Cost-error relationship better than for SRS 

There could be some trade-off between these since a more timely 
survey is of greater value and therefore worth additional ex- 
penditure . 

Currently, SRS produces the acreage estimates for corn— ^ 
on approximately July 10 (exact date depends on the day of 
the week on which the 10th occurs.) The basis for that esti- 
mate is survey data collected in late June with, essentially, 
a July/cut-off for new information. Similarily, production 
estimates are released on the 10th of succeeding months based 
on information compiled essentially as of the first of the month 
Current SRS procedures involve use of the postal service to 
deliver state information to Washington D. C. for compilation. 

If sufficient value were attached to speed, the information 
could be transferred by other means (courier, telegraph, tele- 
phone, etc.). Since this is not done, there is an implicit 
judgment that the higher cost of transmitting by other means 
(or the increased risk of security leaks providing inside 
trading information) are larger than the value. 

Thus, a satellite-based system would need to provide 
comparable timeliness or significant cost saving. The first 
requisite is returning the data, or imagery, to earth soon 

1 / Corn is used as an example because it was the major crop 

in the test area. 




after the observations are made. Radio transmission may 
provide a quick return to eai;h* The analysis of S-192 data 
in this study indicates a 3 to 5 day processing cycle after 
the data is received. Thus, mul tispectral scanner techni- 
ques with computer processing can provide the timeliness 
needed* Photointerpretation time would be a function of the 
number of technicians available to work on the project when 
imagery is received. Generally, timeliness is more likely 
to be a probelem in a system based on photographic systems 
because the poor quality of telemetric transmission products 
requires actual landing of the film package before processing 
begins . 

Security is a problem in acreage estimation because 
"leaks” provide valuable information to persons trading in 
the commodity markets, both the current cash and the futures 
market*-^ Because of this any satellite based system must 
have security checks built in to aroid premature leaks. This 
does not appear to be a significant problem with satellite 
based systems provided the designers of the analytical system 
are aware of the potential problem and include safeguards to 
prevent such leaks from occuring* 

In the current SRS system the state reports are kept 
under guard and unopened until the day the report is re- 
leased* Once the state reports are opened all operations 
take place in a guarded room with locked window shades, 
no telephones, closed sewer system, etc. to insure se- 
curity* 



Cost is the final criterion of importance. It should 
be clear to the reader that because the Skylab system does 
not provide yield estimates, it is not competitive in cost 
with current SRS procedures. Resource requirements for pro- 
cessing S-192 data are specified in a previous section. In 
addition one would need to specify: 

1) Cost of S-192 data acquisition 

2) Cost of ground truth 

3) Sample design 

Estimates of the costs of S-192 data acquisition were 
not available even though requested from NASA. Unavaila- 
bility of such data is understandable because of the diffi- 
culty of allocating joint costs over a large number of pro- 
jects. 

The cost of ground truth is related to a number of vari- 
ables. In an operational system one could probably contract 
with farmers to provide information at low cost. In other 
words, a set of fields would be Identified in a variety of 
locations to be used for training sets. For a small fee many 
farmers would contract to provide the information if maps of 
fields were provided. The total cost would involve some in- 
itial overhead in establishing locations for training set 
fields, identifying the farmers, creating the maps, and estab 
lishing initial contracts. Operational costs are likely, how 
ever, to be quite small. 

Sample design is a major problem. Some comments on this 
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are offered in a following section* We have not, however, 
attempted a definitive sampling design because of the cost 
involved . 

Thus, total costs of an operational system have not been 
developed to compare with SRS costs. It is clear, however, 
that a system for acreage estimation in the U. S. at the pre- 
sent time requires refinement beyond that demonstrated in the 
Skylab system* 


r mparison of Remote Sensing Systems for Crop Survey Purposes 

The several investigators for the present project have 
collectively been involved with studies covering most of the 
unclassified remote sensing systems (except radar) as they 
pertain to crop surveys* It is of interest to draw some com- 
parisons between these systems with respect to utility for 
crop survey work* 

The first major distinction to be made with respect to 
sensor systems is between photographic cameras and multi- 
spectral scanners. Photographic camera systems typically 
produce high geometric detail with the primary product (film 
or print) being most readily adaptable to analysis by human 
interpreters. Film and print products are not adaptable to 
telemetry without secondary scanning which involves major loss 
of geometric information. Mul tispec tral scanner systems pro- 
duce high spectral detail with the magnetic tapes being readily 
adapted to computer processing. Data produced by mul tispec tral 


scanners can also be telemetered without appreciable loss of 
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information. Spectral detail can be increased with photo- 
graphic systems by manipulating film/filter combinations and 
using multiple lens systems such as the ERE P S-190A. The only 
real avenues to increasing geometric detail with multispectral 
scanners are to reduce the pixel size or to obtain data from 
several angles of view. 

The second major consideration is the vehicle used to 
carry the sensor and the altitude at which it operates. Con- 
ventional aircraft and satellites constitute the two major 
categories. Within the aircraft category, the higher the 
altitude of operation the more loss of geometric detail through 
reduced resolution. However, high altitudes do offer cost 
savings because fewer frames and flight miles are needed to 
cover an area. High flying altitudes also extend the area 
which can be covered in a single mission since fuel and other 
expendables along with sun angle windows normally limit the 
area that can be covered. After each mission the aircraft 
returns to its ground base, so there are no problems in trans- 
porting data from the vehicle back to the ground. Satellites 
operate at such high altitudes that resolution becomes very 
critical. The minimum requirement for resolution in crop sur- 
vey work is that the observational unit, normally the field, 
be distinguishable. Satellite altitudes offer the major ad- 
vantage of allowing synoptic coverage of large areas in a 
short period of time, and orbits can be adjusted to give re- 
peating coverage at regular intervals. The fact that satel- 


ORIGINAC PAGE IS 
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lites remain in orbit rather than returning to the ground 
introduces the problem of transporting data back to the 
ground in timely fashion. 

Conventional aircraft carrying either photographic cam- 
eras or scanners have been used for studies of agricultural 
crops for some time. When operated at sufficiently low al- 
titude, either type of sensor is capable of giving both the 
geometric and spectral detail needed to classify crops, measure 
acreage, and detect major crop stresses. For example, photo- 
interpretation of crop type and acreage measurement from CIR 
U-2 imagery at a scale of 1:130,000 was used frequently as 
pseudo ground truth in the present project. A study of clas- 
sification accuracy for interpretation of this imagery was 
conducted, with the results shown in Table 37. Normally in 
working with imagery of this type, the interpreter can classify 
the typical signatures without difficulty and ground checks 

Table 37. Classification accuracy for photointerpretation 
of CIR imagery with scale 1:130,000. 

Category Actual no. of fields No. classified correctly Percent accurac; 
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are necessary only for atypical fields- Imagery from con- 
ventional aircraft has not been used extensively for broad- 
area crop surveys, however, because of the cost and logisti- 
cal difficulties involved in obt lining new imagery for large 
areas each year at the needed times. 

Since synoptic coverage in short periods of time is one 
of the major advantages of remote sensing from satellites, 
potential for use in crop surveys was extensively studied 
under the Earth Resources Technology Satellite program (now 
LANDSAT) . The ERTS-1 satellite carried a 4-channel multi- 
spectral scanner for which the ground resolution element, 
pixel, was approximately one acre in size- The utility of 
ERTS-1 data for agricultural and forestry purposes was studied 
under a NASA sponsored cooperative research program between 
MSU and ERIM— . The one-acre resolution was found to be too 
coarse for many applications in agriculture and forestry, par- 
ticularly so when interpretation was done manually. Within 
fields and forest stands the geometric clues of size, shape, 
shadow pattern, texture, and association of features normally 
used by the human interpreter are lost, leaving spectral sig- 
nature as the primary clue for identification. Most human 
interpreters are not particularly adept at detecting tonal 
differences corresponding to 2 or 3 counts of the sensor, and 
comparison of bands is largely limited to the process of color 


1 J Wayne Myers, et, al. "Use of ERTS Data for a Multidiscip 

Unary Analysis of Michigan Resources,” MSU, Report pre- 
pared for Goddard Space Flight Center, November 1974. 




c omy •-> a i t i £ or color-additive viewing. 

It wt' • f und that forest stands of about 80 acres or 
. <• t>\ 1 be Identified and mapped consistently. Detection 

swti “7 stands was inconsistent and depended mainly upon 
t‘‘ s? degree of target to background contrast provided by border- 
ing types. Since forests were the most readily interpretable 
of the vegetation types occurring in Michigan, even more strin- 
gent limitations applied to manual interpretation of agricul- 
tural crops. The ability to interpret at this level provides 
useful information only where field sizes are large, as for 
example, in the Imperial Valley of California and portions of 
the midwest grain belt. Even with such large tracts, however, 
the existence of a well-developed crop calendar and repeated 
coverage within the growing season is needed for interpretive 
purposes. In areas characterized by smaller fields, the utility 
of manually interpreted ERTS-type imagery is limited to design- 
ing the ground phases of surveys and in detecting shifts of 
land away from agriculture. The telemetered nature of ERTS 
data makes it much more adaptable for computer analysis than 
for manual interpretation. Using multivariate techniques of 
pattern recognition, the full complement of spectral informa- 
tion can be extracted from the MSS data and utilized in clas- 
sification. Using these techniques, a classification accuracy 
on the order of 85 percent was achieved for pixels that were 
entirely within a cover type (Table 38) using ERTS-1 data. For 
large fields where most of the pixels are pure, an 85 percent 
accuracy level approach was needed for making automated 
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Table 

Actual 

condition 

38. 

Number 
of plots 

Summary of 

Number 
of points 

ERTS -1 

classification results*. 
Percent classified as: 

Corn 

Soybeans 

Trees 

Bare 

soil 

Senescent 

vegetation 

Corn 

32 

444 

75.5 

0.3 

7.1 

— 

16.8 

Soybeans 

7 

51 

— 

84.9 

— 

— 

6.7 

Trees 

5 

75 

11.8 

— 

88.2 

— 

— 

Bare soil 

5 

36 

— 

— 

— 

95.0 

5.0 

Senescent 

vegetation 

47 

258 

9.1 

4.5 

0.7 

7.8 

76.8 

Total 

96 

864 

96.4 

89.7 

96.0 

102.8 

105.3 


* Data from Eaton County, Michigan, August 25, 1972. Analysis performed 
at ERIM. 


surveys of crop acreage, provided the errors are random. The 
problems arise for areas where the tract size is small ( 5 to 
20 acres) as it is in many parts of Michigan. From earlier 
tables, for instance, it can be seen that 67 percent qf the 
fields and 33 percent of the acreage in the Ingham County test 
strip were included in fields of 20 acres or less. Table 39 
shows the results of an analysis performed at ERIM laboratories 
on average numbers and ranges of resolution elements, pixels, 
that fell within fields of various sizes in a sample of ERTS 
imagery. As is evident from Table 39, field sizes must ap- 
proach 20 acres before there is a high probability that even 
a few resolution elements will be entirely within the field 
so that they are not affected by mixed signatures with neigh- 
boring fields. In the present study a surprisingly large 


ID- 


Table 39. Average numbers and ranges of resolution A 
elements falling within various field sizes. 


Field size 


No. fields 


Avr. points 


Range of points 


5 - 9.9 A. 


10-14.9 A. 


15-19.9 A. 


20-29.9 A. 


30-49.9 A. 


10.20 


50 and above 


74.91 


Data from Eaton County, Michigan, August 25, 1972. Analysis 
performed at ERIM. 


number of pixels were pure and the attempt to improve ac- 
curacy through a mixtures processor was fruitless. This un- 
expected result may have occurred because of the poor S-192 
data quality rather than because of an error in estimating 
the expected number of mixed pixels for the field sizes in 
the test area. 

With completely pre-programmed computer analysis there 
is also difficulty in segregating estimates by small sampling 
units such as fields or sections in order to use them for 
calculating contributions to sampling error in random samplin 
designs. This difficulty can be circumvented by interactive 
computing in which an operator uses a cursor on a cathode 
ray display to select subsets of the data, but this requires 
sophisticated computational equipment. Another approach is 


FEPRODUC: r-.- ... 

oitiCs i ~ r - \ - ■ c t-i t> - " 1 ' 
- la Pt-'iJJ? 






to determine the corner coordinates for the sampling units 
and input these to preprogrammed search routines, but this 
also adds significantly to the cost of the analysis. 

Skylab S-190A photography interpreted manually has an 
effective resolution similar to or slightly better than com- 
puter processed ERTS data. The ability to discriminate crop 
types is much better with computer processed ERTS data than 
w* th manually interpreted S-190A data. Since the S-190A 
imagery was quite variable in quality and cannot be teleme- 
tered as can T-ISS data, S-190A appears to offer little in the 
way of increased utility for crop acreage estimation. The 
excellent resolution of the S-190B camera system constitutes 
a major advance over other sensor systems previously avail- 
able to the public for analysis of earth resources. The re- 
solution of S-190B is sufficient for rapidly locating indi- 
vidual fields or larger sampling units, delineating the bound- 
aries, and measuring the acreage. These operations are either 
difficult or not possible at all with the other sensors dis- 
cussed. Although the SO-242 natural color film did not prc- 
vide enough contrast in spectral signatures for accurate recog 
nition of individual crops, such as soybeans, on a single 
occasion, separation would be possible through temporal over- 
lays of coverage obtained on several dates. Furthermore, 
the S-190B imagery can be enlarged photographically without 
excessive loss of detail to produce a mapping base for sub- 
sequent interpretations. There is a problem in manning a 
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space station such as Skylab over the length of time neces- 
sary to obtain imagery at several dates in different parts 
of the country as well as in transporting the exposed film 
to the ground at frequent intervals so that it will not be 
out-of-date before it reaches the survey analysts. The only 
apparent solution to these problems is through development 
and routine use of a space shuttle. Development of high- 
resolution color-infrared film would reduce the need for 
multiple coverage since there is a high probability that 
most crops could be distinguished on a single date with such 
film. 

Perhaps the most desirable approach of all, if techni- 
cally feasible, would be development of a multi-spectral scan- 
ner capable of resolution equivalent to that of the S-190B 
camera system. With a scanner such as this, spectral detail 
could be fully utilized in automated computer processing and 
the data could be telemetered immediately to ground stations 
upon collection providing timely survey results. 

There seems to be little hope of assessing crop condi- 
tion as opposed to identification and acreage measurement with 
any satellite sensor likely to be available in the near future, 
since it would be necessary to detect small groups of plants 
in order to accomplish this to any extent. Assessment of 
condition, for example, is on the margin of capabilities for 
high-altitude CIR imagery and aircraft MSS data* Very large 
scale imagery in combination with ground visitation is re- 
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quired for the more detailed aspects of assessing crop con- 
dition. This is not to imply that large-scale stress con- 
ditions cannot be detected, but such conditions are already 
evident without the use of satellite sensors. 

Performance of the S-192 has been discussed earlier. 

In general, the accuracy of classification was not high and 
there were significant under and overestimation of acreages for 
some crops. Better resolution and a greater signal range in 
the data are needed if a feasible system is to be developed. 

Possible Configurations for Crop Surveys Incorporating 
the Use of Satellite Data 

One possible approach is to base crop acreage surveys 
primarily on satellite data from a further development of 
the S-192 type scanner supplemented by a network of ground 
surveyed plots that would serve for developing training sets 
and checking the accuracy of classification. This would re- 
quire substantially complete coverage of the sample areas 
on either 3 or 4 occasions during the year. A late fall pass 
would serve for assessing harvest and Identifying winter wheat. 
A pass immediately following the planting season would serve 
to separate perenniel forage crops from fields under current 
cultivation for row crops. A late summer pass after sene- 
scence of beans but prior to senescence of corn would serve 
to separate these two major row crops. There would be some 
indeterminism in such a 3-pass system between spring grains 
such as oats, rye, and barley and fallow fields. A fourth 


pass might be added in the middle of the growing season to 
remove this indeterminism. Given differences in crop phen- 
ology across the country, this sort of system would require 
a satellite station to be operated essentially year-round, 
with telemetric return of data in order to provide timely 
results. It must also be recognized that portions of a re- 
gion may be cloud covered at any given satellite pass, thus 
necessitating supplemental passes to fill the gaps. The 
existence of such complete coverage for a region would also 
serve as a good basis for defining sampling frames and set- 
ting up efficient probability sampling designs. 

Such complete reliance on satellite imagery, however, 
makes the survey subject to severe disruption from malfunc- 
tion in satellite systems or related logistical problems. 
Reasonably clear weather conditions on a regional scale 
are also assumed to occur at critical points in the growing 
season. Furthermore, the lag time to be expected between 
acquisition of data in the spacecraft and delivery to re- 
gional laboratories for anr; lysis will create a continual 
time pressure on the rale - . f survey results. Yet another 

problem will be lack of prov — :a for information on crop 
condition, which is a key component of the ability to pre- 
dict yields. In view of these potential difficulties, some 
diversification of data sources would be desirable. Existing 
information on major crop types, soils, and meteorology would 
provide the data needed to break each region into sampling 


ORIGINAL, PAGE 38 
OP POOR QUAEZCXJ 


strata on the basis of agricultural characteristics such as 
percent of area available for cultivation, distribution of 
field sizes, generalized crop mix, and productive potential. 

A multi-stage sampling system nested within these strata 
could be drawn up. A set of satellite imagery would be 
collected in early summer at the close of the planting sea- 
son. From this imagery, percentages of tillable acreage 
under cultivation for row crops and small grains could be 
determined on a sample of primary units in each strata. Each 
primary unit in this sample would be further subdivided into 
secondary sampling units, a sample of which would be flown 
periodically during the growing season with an inexpensive 
and readily available combination of small camera (35mm or 
70mm) and light aircraft (for example, Cessna 172), Inter- 
pretation of the small camera imagery coupled with acreage 
measurement from the satellite imagery would give percent 
of acreage under cultivation devoted to each major crop type 
and stage of maturity. The secondary units sampled would be 
further divided into tertiary units, a sample of which would 
be visited on the ground for assessment of crop condition 
and verification of interpretations from the small camera 
imagery. Estimates obtained from this type of integrated 
survey would allow yield prediction, give increased accuracy 
and be more timely than surveys based primarily on more fre- 
quent satellite coverage. Furthermore, the total cost of 
the integrated survey would probably be less if real costs 
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of obtaining the more frequent satellite coverage are con- 
sidered. Speculation on survey designs incorporating multi- 
spectral scanner data obtainei-from satellites assumes that 
the resolution of these sensors is improved by about a factor 
of two over that obtained with the LANDSAT systems. 



